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ABSTRACT 

The effects of sampling are investigated on measurements of counts-in-cells in three- 
dimensional magnitude limited galaxy surveys, with emphasis on moments of the 
underlying smooth galaxy density field convolved with a spherical window. A new 
estimator is proposed for measuring the fc-th order moment (p k ): the weighted factorial 
moment [w] ■ Since these statistics are corrected for the effects of the varying selection 
function, they can extract the moments in one pass without the need of constructing 
a series of volume limited samples. 

The cosmic error on the measurement of -Ffe[<x>] is computed via the the formal- 
ism of Szapudi & Colombi (1996), which is generalized to include the effects of the 
selection function. The integral equation for finding the minimum variance weight is 
solved numerically, and an accurate and intuitive analytical approximation is derived 
WoptimaiM K l/^Mi where A(r) is the cosmic error as a function of the distance from 
the observer. The resulting estimator is more accurate than the traditional method 
of counts-in-cells in volume limited samples, which discards useful information. As a 
practical example, it is demonstrated that, unless unforeseen systematics will prevent 
it, the proposed method will extract moments of the galaxy distribution in the future 
Sloan Digital Sky Survey (hereafter SDSS) with accuracy of order few percent for 
k = 2, 3 and better than 10% for k = 4 in the scale range of 1 h^ 1 Mpc < I < 50 h^ 1 
Mpc. In the particular case of the SDSS, a homogeneous (spatial) weight uj = 1 is 
reasonably close to optimal. 

Optimal sampling strategies for designing magnitude limited redshift surveys are 
investigated as well. The arguments of Kaiser (1986) are extended to higher order 
moments, and it is found that the optimal strategy depends greatly on the statistics 
and scales considered. A sampling rate / ~ 1/3 — 1/10 is appropriate to measure low- 
order moments with k < 4 in the scale range 1 h^ 1 < I <> 50 hr 1 Mpc. However, the 
optimal sampling rate increases with k, the order considered, and with l/£. Therefore 
count-in-cells statistics in general, such as the shape of the distribution function, high 
order moments, cluster selection, etc., require full sampling, especially at small, highly 
nonlinear scales i ~ 1 h^ 1 Mpc. 

Another design issue is the optimal geometry of a catalog, when it covers only a 
small fraction of the sky. Similarly as Kaiser (1996), we find that a survey composed 
of several compact subsamples of angular size Qf spread over the sky on a glass-like 
structure would do better, with regards to the cosmic error, than the compact or the 
traditional slice like configurations, at least at small scales. The required dynamic 
range of the measurements determines the characteristic size of the subsamples. It is 
however difficult to estimate, since an accurate and cumbersome calculation of edge 
effects would be required at scales comparable to the size of a subsample. 

Key words: large scale structure of the universe - galaxies: clustering - methods: 
numerical - methods: statistical 
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1 INTRODUCTION 

The large scale structure of the Universe is generally ad- 
mitted to be homogeneous at scales above ~ 150 Mpc. At 
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smaller scales, observations of the galaxy distribution show 
a remarkable clustering as evidenced by voids, clusters, fil- 
aments, and superclusters. According to standard theories, 
these structures grew from small initial fluctuations under 
the influence of gravity. This, together with possible bias- 
ing, resulted in the random point process represented by 
galaxies. Thus, statistical methods can be applied efficiently 
to galaxy surveys to constrain models of large scale struc- 
ture formation. Once the statistical tool is selected, two im- 
portant questions influence its applicability. First, an opti- 
mal sampling strategy can be used to build a galaxy cata- 
log (e.g. Kaiser f986, hereafter K86), thus maximizing the 
information content with respect to the statistics used. Sec- 
ond, an optimal measurement method can be used to extract 
the maximum amount of information present in the catalog. 
The aim of this work is to address both of these questions 
in a quantitative way, focusing on low order moments of the 
probability distribution function (PDF) of the large scale 
galaxy density field. To achieve these goals, a number of 
plausible, nevertheless important assumptions were made, 
which are described next. 

The galaxy distribution is assumed to be a discrete, lo- 
cally Poissonian realization of an underlying smooth random 
field. To estimate the moments of the PDF of this random 
field, factorial moments (e.g. Szapudi & Szalay f993a) of 
the count probability distribution function Pn(£) (CPDF) 
are used. By definition, the CPDF represents the proba- 
bility of finding N galaxies in a spherical (circular) cell of 
radius t thrown at random in a three-dimensional (two- 
dimensional) galaxy catalog. The CPDF is easy to mea- 
sure and widely used to test the scaling behavior of galaxy 
catalogs (see e.g. Alimi, Blanchard & Schaeffer f990; Mau- 
rogordato, Schaeffer & da Costa f992; Szapudi, Szalay & 
Bochan f992; Bouchet et al. f993; Gaztanaga f992, 1994; 
Szapudi, Meiksin & Nichol 1996, Szapudi & Szalay 1997a) 
and iV-body simulations data sets (see e.g. Bouchet, Scha- 
effer & Davis 1991; Bouchet & Hernquist 1992; Baugh, 
Gaztanaga & Efstathiou 1995; Gaztanaga & Baugh 1995; 
Colombi, Bouchet & Hernquist 1996). 

Throughout this work, we assume an ideal three- 
dimensional magnitude limited galaxy catalog £, of depth 
-Rmax, with magnitude limit Mn m , covering a given volume 
V of the universe, and containing a number iV bj of spher- 
ical coordinates of galaxies, (z,0,(j>), z being the measured 
redshift. 

A purely statistical approach is used: except for the ef- 
fects of the selection function in a magnitude limited sample, 
all observational effects are ignored, such as extinction, con- 
fusion limit, and systematic errors due to the imperfection 
of the instruments. 

Redshifts are considered as pure distances, i.e. effects of 
projection in redshift space are neglected. Such effects can 
significantly change the behavior of the CPDF, especially in 
the nonlinear regime (e.g. Kaiser 1987; Lahav et al. 1993; 
Matsubara & Suto 1994; Hivon et al. 1995). 

Finally, it is assumed that the clustering of galaxies does 
not depend significantly on their luminosity, which is proba- 
bly a crude approximation. Indeed, there are both theoreti- 
cal (e.g. White et al. 1987; Mo & White 1996; Valls-Gabaud, 
Alimi & Blanchard 1989; Bernardeau & Schaeffer 1992, here- 
after BeS) and observational arguments (e.g. Hamilton 1988; 
Davis et al. 1988; Dominguez-Tenreiro & Martinez 1989; 



Benoist et al. 1996) suggesting that the level of clustering of 
galaxies increases with their luminosity. 

In realistic redshift surveys, the average number den- 
sity of galaxies n(r) changes with the distance r from the 
observer. The CPDF is traditionally defined in an homo- 
geneous catalog, i.e. with constant n(r). One way to bypass 
this problem is extraction of volume- limited subsamples £y L 
of depth Ri < -R m ax from the main catalog £ (see for exam- 
ple Maurogordato et al. 1992; Bouchet et al. 1993). In these 
subsamples, the objects are such that their apparent magni- 
tude at distance r — Ri would be larger than Mn m . Such a 
selection criterion renders the number density of galaxies in 
the catalog independent of distance, at the price of signifi- 
cant information loss. Although this can be partially recov- 
ered by cutting several volume-limited catalogs with various 
values of Ri, it would be preferable to extract all the avail- 
able information from the catalog in a single measurement. 
This is possible by defining an inhomogeneous counts-in-cells 
measure which is corrected for the variation of n(r) with the 
distance from the observer, similarly to, e.g., Efstathiou et 
al. 1990 and Szapudi & Szalay 1996. This, combined with a 
minimum variance weighting, results in unbiased, selection 
corrected estimators of the iV-th factorial moment of the 
galaxy counts. To clarify the substantial gain from such an 
approach, we carefully calculate the errors on the measure- 
ments, and show how our method minimizes them. 

Only statistical errors are considered, caused by the fact 
that only a finite part of the universe is accessible for ob- 
servations. The corresponding cosmic error was calculated 
by Szapudi & Colombi (1996, hereafter SC) for count-in- 
cells measurements in an homogeneous catalog. The differ- 
ent contributions were classified as follows: 



(i) The finite volume error is due to fluctuations of the 
underlying random field at wavelengths larger than the size 
of the catalog. The finiteness of the sampled volume causes 
systematic effects on the measurement of the CPDF and its 
moments, even in iV-body simulations (Colombi, Bouchet & 
Schaeffer 1994, 1995, hereafter CBSI and CBSII). 

(ii) Edge effects are related to the geometry of the cat- 
alog: the galaxies near the edges of the catalog receive less 
statistical weight than those far from the boundaries. Edge 
effects can be theoretically corrected for, at least partially. 
A corrected estimator can be used for the two-point corre- 
lation function £(r) (see, e.g. Ripley 1988, p. 22; Landy & 
Szalay 1992) from which the variance of the PDF, can 
be obtained as a double integral of £ over a cell of radius i. A 
class of edge corrected estimators was actually recently in- 
troduced by Szapudi & Szalay (1997b) for the higher order 
moments as well. They, however, use a somewhat compli- 
cated procedure which is applicable only to the moments of 
fluctuations. 

(iii) The shot noise error (or, equivalently, the error from 
discreteness effects) is due to the incomplete sampling of 
the underlying smooth field with a finite number of points. 
In particular, excessive undersampling causes degeneracy of 
the CPDF, rendering the confrontation of various models 
against observations difficult (e.g. Bouchet et al. 1993, CB- 
SII). By definition, discreteness effects tend to zero as the 
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average number density of objects in the catalog approaches 
infinity^. 

While unbiased estimators can be constructed in a num- 
ber of ways, the above sources of errors can be reduced by 
giving an appropriate statistical weight depending on the 
region in the catalog. In a magnitude limited catalog for 
instance, density decreases far away from the observer, re- 
sulting in an increase in the shot noise. This alone would call 
for increasing statistical weight close to the observer, thus 
decreasing the contribution of distant portion of the survey. 
However, this reduces the effective sampled volume, and in- 
creases the finite volume and the edge contributions to the 
error. There exists a "minimum variance" weighting which 
provides a compromise between various effects. By deriving 
and solving the integral equation for the minimum variance 
weighting scheme, we show that a significant gain in accu- 
racy on the measurement can be achieved with our method, 
compared to the traditional volume-limited approach. Note 
that the word "optimal" is used synonymously with mini- 
mum variance in this paper, i.e. producing the smallest error 
in a class of unbiased estimators. Although this usage does 
not emphasize it, it is possible in principle to find even bet- 
ter estimators by extending the class in which the search is 
performed. 

While the above procedure can optimize the way infor- 
mation is retrieved from an existing survey, calculation of 
cosmic errors helps in designing optimal surveys. In partic- 
ular, an optimal sampling strategy can be found to build a 
three dimensional galaxy catalog for a given statistical indi- 
cator. Given the available telescope time, K86 found the op- 
timal strategy for the measurement of the two-point correla- 
tion function of galaxies, using a simple model for the cosmic 
error. To reduce finite volume and edge effects, which are in- 
dependent of the number of objects in the catalog, a sparse 
survey with large volume is needed. To reduce discreteness 
effects, on the other hand, a catalog as dense as possible 
is preferable. These competing effects determine the opti- 
mal sampling strategy with respect to a particular statistic. 
For the two-point function, K86 concluded in favor of sparse 
samples with approximately 1/10 to 1/20 of the candidates 
randomly selected for measuring their redshifts. Sparse sam- 
pling strategies were in fact used for several surveys, such as 
the Stromlo APM redshift survey (e.g. Loveday et al. 1992) , 
the QDOT redshift survey (e.g. Moore et al. 1994), and the 
Durham/UKST redshift survey (e.g. Ratcliffe et al. 1997). 
However, the advent of multi-fiber spectroscopy makes large, 
complete galaxy surveys possible in a significantly shorter 
time than when K86 proposed his idea. Prime examples are 
the Sloan Digital Sky Survey (SDSS, e.g. Loveday 1996), and 
the 2DF Survey (e.g. Lahav 1996). While the problem of op- 
timal sampling strategies might loose from its relevancy with 
these new developments, it is still worth to study it: either to 
facilitate preliminary investigations to prepare large surveys, 
or optimize the design of surveys aimed at statistical proper- 
ties of rarer or harder to find objects at various wavelengths. 
Indeed, it will be shown how sparse sampling strategies are 

* Note however that part of the shot noise belongs to the subclass 
of "edge-discreteness" effects, i.e. it is also an edge effect, and can 
be corrected for with appropriate techniques (Szapudi & Szalay 
1997b). 



relevant for measuring low order moments in a reasonable 
scale range, while high order statistics, direct analysis of the 
CPDF shape, cluster selection, etc., are highly sensitive to 
discreteness effects, therefore proving that full sampling is 
more advantageous. 

Another important design issue, when the catalog has 
poor sky coverage, is its geometry (e.g. Kaiser 1996, here- 
after K96). This affects finite volume, edge, and shot noise 
effects in different ways. Although a general solution to this 
complicated problem will not be given, it will be discussed 
in broad terms, with suggestions for reasonable design prin- 
ciples. The solution for the conceptually simplest case will 
be given, which is relevant even for surveys which will even- 
tually cover a continuous portion of the sky. The results 
presented here can facilitate the extraction of preliminary 
results before the catalog is finished. 

This article is organized as follows. § 2 introduces the 
statistical indicators used along this paper, i.e. the weighted 
factorial moments corrected for selection effects. § 3 con- 
tains the calculation of the general expression for the cos- 
mic error, extending the results of SC to a magnitude lim- 
ited catalog with a spatial weight. The integral equation 
for the optimal weight is given. Some simple but still quite 
general assumptions on the underlying statistics are made 
to enable the numerical calculation of the optimal weight 
and the corresponding cosmic error. § 4 elaborates a prac- 
tical example: a SDSS-like catalog. In particular, a useful 
approximation for the optimal weight is found. We empha- 
size the advantages of extracting the information from the 
data with the optimal weight, compared to the traditional 
volume limited approach. In § 5, sparse sampling strategies 
are studied along with the suitable choice of geometry for the 
catalog. § 6 summarizes and discusses the results. In addi- 
tion, three Appendices provide further information. § A con- 
tains mathematical details concerning the calculation of the 
cosmic error. § B explains and tests the numerical method 
used to estimate the optimal weight. § C contains mathe- 
matical formulae used to derive analytical expressions for 
the optimal sampling rate in some asymptotic regimes. 



2 WEIGHTED FACTORIAL MOMENTS 

CORRECTED FOR SELECTION EFFECTS 

For a statistically homogeneous catalog, the CPDF Pn is 
the probability of finding N galaxies in a cell of size I. The 
factorial moments of counts in cells are defined for k > by 

F k (l) = ((N) k ) = J2( N )k P »> (!) 

JV 

where the falling factorials are defined as (N)k = N(N — 
1) . . . (N-k+1), and (N) = 1, therefore F = 1. Under the 
assumption of infinitesimal Poisson sampling (Peebles 1980, 
p. 147) the factorial moments correct directly for the dis- 
crete nature of galaxy catalogs. More precisely, their ensem- 
ble averages are equal to the moments of the (appropriately 
normalized) underlying smooth density field p ((p) = 1), 

F k = N k (p k ), (2) 

where N = (N) = F\ is the average number of objects per 
cell (see Szapudi & Szalay 1993a). 
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Counts-in-cells statistics are estimated by a large num- 
ber of sampling cells positioned at random in the catalog. 
For a homogeneous survey, the corresponding estimator of 
the factorial moment is expressed by the counts Ni in the 
ith cell, 1< i < C, 



c 

ST, 



(3) 



Equivalently, the CPDF can be estimated first (e.g. SC), 
which gives the above estimator through equation (|l|). The 
form (S) is meaningful if all the cells are equivalent, i.e. if 
they all have the same statistical weight u = 1. In an at- 
tempt to correct for finite volume and edge effects, a weight 
W£,fc(rj) (to be determined later) can be assigned to each 
cell i. This can depend on the position and size of the cell, 
and the statistic at hand: 



c 



(4) 



The weight uji^ is determined by minimizing the value of 
the cosmic error under the constraint of appropriate nor- 
malization. 

A magnitude limited catalog £ , such as defined in in- 
troduction, is inhomogeneous since the selection function is 
not uniform. If n is the real number density of the underly- 
ing galaxy distribution (assuming that such number is well 
defined), the effective number density n(r) of galaxies in a 
thin shell at distance r from the observer reads 



n(r) — n<p(r), 



(•») 



where <j> is the selection function. In particular, the average 
number of galaxies in a cell of radius £ at distance r from 
the observer is given by 



4 3 

N r = n e (r)v, v = -n£ , 

with 



m(r) = n<f>e(r), 



(6) 



(7) 



and 4'e{ r ) ' s the average of the selection function over a 
cell. The approximation cj>e ~ cj> is excellent for the relevant 
scales, valid within a few percents at worst. It will be used 
for practical calculations throughout. 

To correct for selection effects we follow Szapudi & Sza- 
lay (1996) by changing equation M) in 



c 

c ^ 



(Ni) k mi ik (ri) 
[Mn)] k ' 



(8) 



This is the final form we propose for the estimator of the 
factorial moments. Note that a priori knowledge of the se- 
lection function <f>(r) is assumed. Although implicitly always 
present, the k and £ dependence of the weights ui( t k will be 
usually omitted. The normalization of the weights follows 
from the requirement that the estimator is unbiased. Taking 
the ensemble average of the above equation, and averaging 
all possible random realizations of C cells, gives 



1 



V{1) 



d 3 rui(r) = 1, 



(9) 



where V{£) is the effective sampled volume, i.e. the volume 
occupied by the center of all possible cells of radius I con- 
tained in the catalog. While the proposed estimator can be 
used directly to estimate the factorial moments, it might 
be advantageous to introduce the inhomogeneous CPDF, 
Pjv(r). Once this is estimated, the inhomogeneous factorial 
moments Ft(r) can be calculated. Using the scaling (f>(r)~ k 
and summing over with the appropriate weights ui(r) is iden- 
tical to the proposed estimator. This way, however, any high 
order moment can be calculated using the inhomogeneous 
CPDF without the need of rescanning the whole catalog. 



3 COSMIC ERROR AND OPTIMAL WEIGHT 

This section generalizes the formalism for computing the 
cosmic error presented by SC to the case of inhomogeneous 
selection function and weight. The equation for the optimal 
weight is derived in § 3.1. The locally Poissonian and hier- 
archical assumptions are used to simplify the calculations in 
§ 3.2. A few comments follow on the interpretation of the 
results thus far (§ 3.3), and, finally, it is shown how this 
formalism can be applied to practical measurements (§ 3.4). 



3.1 Formalism 

The variance of F k is defined by 

(a^) 2 m<^)) c -«^>>: 



> 2 

'c : 



(10) 



where ( ), and ( ) c are the ensemble average, and averag- 
ing over all possible sets of sampling cells, respectively (we 
dropped the C dependence of F^). For the proposed estima- 
tor 



i 1=1 / c 

' 1 , x , v <(M)*(^i)*> \ 

^g" (r<Mrj? W^(rJ]* / ' 



(11) 



Following SC, the evaluation of this expression in terms of 
the parameters of the distribution is facilitated by gener- 
ating functions. Let us introduce the generating function, 
P r (x), of the probability Pzv(V) of finding iV objects in a 
cell of size £ at position r, where the effective average num- 
ber density is ne(r): 



P r (x) EE ^PNir). 



(12) 



Similarly, we define Pri,rAx,y) as the generating function 
of the bivariate probability PN,M{fi,rj) of finding N and 
M galaxies in cells of radius £ respectively at positions ri 
and r,: 



p r,,rA x 'y) 



N,M=0 



N M t, , n 

x y PN,M{ri,rj) 



As SC, we write formally 



(13) 



(14) 



x— y— 
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The generating function of the total error, E c ' v (x,y), is 
asymptotically the sum of two generating functions 



E'- 



\x, y) = (l - 1) E°°- V (x, y) + E c ^(x, y). (15) 



Function E°°' (x,y) generates the errors for hypothetical 
surveys with finite volume V and infinite number of sam- 
pling cells: 



E°°' V (x,y) 



d 3 r 1 d 3 r 2 uj(r 1 )uj(r2)[(t>(ri)<j>(r2)] k 



{P ri ,r 2 (x,y)-P ri (x)P r2 (y)}. (16) 

Function E c '°° (x, y) generates the errors due the finite num- 
ber of sampling cells used to do the measurement!!: 



E^°°(x,y)= £ J ^(r)^)]-" Pr(xy) 



V 2 



d 3 ruj(r)[(f>(r)]- k P r (x) 



d 3 ru>(r)[</>(r)}- k P r (y) 



(17) 



Note that the generating function defined above can only be 
used for the error on the fe-th order moment, i.e. for each 
k a slightly different generating function must be used. The 
reason for this is the implicit k dependence of the weight u) 
(the scaling with the selection function could be taken into 
account simply with the substitution x —> cjjx — <f> + 1). Al- 
though it would be possible to define a single but three vari- 
ate generating function for all orders, it is simpler and more 
practical to use the above definition. Also, for the sake of 
completeness, we quoted both the cosmic error E°°' v (x, y), 
and the measurement error E c '°° (x,y). The former is an in- 
herent property of the survey, while the latter is related to 
the finite number of sampling cells C used to do the mea- 
surement. Since this contribution can be rendered arbitrarily 
small with massive "oversampling" , C — > oo, or with an al- 
gorithm corresponding to C — oo (Szapudi 1997) it will not 
be considered further (see SC for a discussion on the "num- 
ber of statistically independent cells", i.e. the number of 
cells needed to extract all the relevant information from the 
catalog). In what follows, the upper indices will be dropped 
from the cosmic error for simplicity, E = E c ' v ~ _E°°' V . 

The above formalism provides the framework to com- 
pute the optimal sampling weight, which minimizes the cos- 
mic error on the measurement of Fk. Taking into account 
the normalization of the weights leads to the following La- 
grangian 



+ 2Ai i / d 3 ruo(r) - 1 
\vJv 



(18) 



t If the weight is homogeneous and if there are no selection ef- 
fects [l> = <t> = 1), as in SC, this function does not depend on the 
survey volume. In that case, it generates the errors for hypotheti- 
cal surveys with infinite volume (V = oo) and finite number C of 
sampling cells. This explains the formal notation " E c (x , y)" . 



The optimal weight to is thus the solution of the following 
integral equation 

{Pr,u(x + 1,1/ + 1) " Pr(x + l)Pu(y + 1)}| B=8=0 

+ A = 0. (19) 

The constant A is determined by the normalization (^) . 



3.2 Approximations 

To compute the optimal weight for measuring Fk and the 
corresponding cosmic error, generating functions Pr,u{x,y) 
and P r (x) are needed. More precisely, after partial differen- 
tiation of order k at x — y — in equations (h4j) , the factors 
Fi im {l, r, u) are needed, which are defined as 



F l , m (e,r,u) = ((N) l (M)J 



(20) 



up to order l + m — 2k. These quantities generalize the con- 
cept of factorial moments to bivariate distributions (Szapudi 
et al. 1995). In equation @, the ensemble average is taken 
over two cells of size £ at positions r and u in the catalog, 
which can possibly overlap. This especially complicates the 
problem of finding the optimal weight for measuring Fk and 
the corresponding cosmic error. 

Following SC, one can make reasonable assumptions 
about the underlying statistics to simplify the calculations 
considerably: the hierarchical tree model, and the locally 
Poissonian approximation. Then, as shown below, only the 
factorial moments Ft, I < 2k and the two point-function 
£(r) are required a priori for the computation of the opti- 
mal weight and the corresponding cosmic error on Fk- The 
calculations are detailed in Appendix A. We summarize here 
the important steps and hypotheses: 



(i) The integral fllq ) is split into two parts, according to 
whether or not the two cells overlap. 

(ii) The calculation of the non-overlapping part requires 
the knowledge of the bivariate count-in-cells generating 
function for disjoint cells. Following SC, we simplify it by 
taking two particular but still quite general cases of the hi- 
erarchical model (e.g. Peebles 1980, p. 206 & 211, Balian & 
Schaeffer 1989, hereafter BS) by SS and BeS (see Appendix 
A for more details). The hierarchical model is seen to be a 
good approximation for the higher order statistics in the ob- 
served galaxy distribution (e.g. Groth & Peebles 1977; Fry 
& Peebles 1978; Sharp, Bonometto & Lucchin 1984; Sza- 
pudi et al. 1992; Meiksin, Szapudi & Szalay 1992; Szapudi 
et al. 1995; Szapudi & Szalay 1997a) and in iV-body sim- 
ulations (e.g. Efstathiou et al. 1988; Bouchet et al. 1991 
Bouchet & Hernquist 1992; Fry, Melott & Shandarin 1993 
Bromley 1994; Lucchin et al. 1994; CBSI, CBSII; Colombi 
et al. 1996). 

The function Pr 1: r 2 ( x y y) i s then Taylor-expanded to first 
order in £{t 12) /(,{£), where f(r) is the two-point correlation 
function, r\ 2 = \r\ — V2\ and 



(21) 



This approximation is becoming more accurate when the 
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cells are far away from each other, however, it is still reason- 
able even when the cells touch each other (e.g. Bernardeau 
1996). 

(iii) To compute the overlapping contribution, the vari- 
ations of the weight and of the selection function are ne- 
glected within the cells. We also assume local Poisson be- 
havior, which considerably simplifies the writing of the bi- 
variate generating function of counts for overlapping cells 
(see SC). 

SC carried out explicitly the calculation to leading or- 
der in v/V for uniform survey and constant weight. They 
found that the cosmic error could be separated into three 
contributions 



A 2 [^ = 1,0=1] = 



/ AF k 



V F k 



= Ap + A| + Ad. 



(22) 



The term A F is the finite volume error discussed in 
introduction, arising from the contribution of disjoint cells 
in equation ([l6]). It is proportional to the integral of the 
two-point function over the sampled volume 



d 3 nd 3 r 2 £(r 12 ) 



(23) 



(L ~ V 1 ^ 3 ). While it depends on the clustering properties 
of the system, i.e., 



{F l /N l ,l<2k} f(L), 



(24) 



it is independent of the average number density n [see 
eq. (0)]. Even if the number of objects in the catalog in- 
creases, the finite volume error remains unchanged. 
Similarly, the edge error A| formally written as 



{fi/N*,1 < 2k\ lv/V, 



(25) 



is independent of average number density as well. Edge ef- 
fects are caused by uneven weighting near the edges of the 
catalogue, thus increasing with v/V. 

Finally, the discreteness (shot noise) error A D can for- 
mally be written as 

A D = JFd {Fi , I < 2k} v/V. (26) 

This is the only contribution to the cosmic error which de- 
pends on average number density n. To leading order in N, it 
is proportional to N v/V thus becomes important at small 
scales or when the number of objects iV bj in the catalog is 
small. 

SC have tested successfully the validity of the above 
approximations by computing the cosmic error in artificial 
galaxy catalogs generated by Rayleigh-Levy random walks, 
for which the clustering properties could be calculated ex- 
actly (e.g. CBSII). 

When u 1 and <j>(r) < 1, equation ( [22] ) generalizes to 
(see Appendix A) 

ALmick^] A|M + A|M + Al[u>,4>], (27) 



with 



ApM = 



Z(L)V 2 Jv 



d 3 rid 3 r 2 w(ri)cj(r 2 )£(ri 2 ), 



(28) 



(29) 



V 



d 3 ru 2 (r)Al(r). 



(30) 



The r dependence of A D in the integral (^o|) is caused by 
the selection function, as this type of error depends on the 
average count. On the other hand, the selection effects nat- 
urally canceled out in the finite volume and the edge errors, 
as expected. 

The integral equation for the optimal weight becomes 

/ d 3 uLu(u)a\r - tt|) + {A| + Ao(r)} uj(r) 
Jv 

+ A = 0, (31) 



t(L)V 



and the constant A is determined by the normalization 
This standard equation can be solved numerically. 



3.3 Interpretation 

From equation (^) , the following immediate conclusions can 
be drawn on how different contributions to the cosmic error 
influence the optimal weight: 

(i) If edge effects were dominant, the optimal weight 
would simply be uniform. This is contrary to intuition sug- 
gesting increasing weight at the edges to compensate for the 
lesser statistical weight carried by these regions of the cata- 
log. This is how edge effects are corrected for the two-point 
function f(r) (e.g. Ripley 1988, p. 22). The finite extension 
of the cells, however, prevents us from correcting for edge 
effects with our indicator ^ , which uses a simple multiplica- 
tive sampling weight. More involved additive correction for 
the moments of the fluctuations of counts in cells will be ex- 
plained elsewhere using the formalism outlined in Szapudi 
& Szalay 1997b. 

(ii) If discreteness effects were dominant, the correct 
weight approximately would be 



w(r) cx 1/A£(r), 



(32) 



in agreement with intuition. Indeed, the effective number 
density n(r) decreases with increasing distance from the ob- 
server, thus the shot noise error increases as well. This is 
compensated by u decreasing with r. 

(iii) Finite volume effects are difficult to predict without 
explicit calculations. The correlation function £(r) is ex- 
pected to follow approximately a power-law behavior from 
observations (Totsuji & Kihara 1969; Peebles 1974; Davis & 
Peebles 1983) 



«->-(s) 



ro — 5 h 1 Mpc, 7 



(33) 



for 0.1 h~ x Mpc <z r ^ 10 h' 1 Mpc. At larger scales it 
decreases rapidly with scale, becoming negative at scales 
around ~ 30 - 100 h' 1 Mpc (e.g. Fisher et al. 1994; Tucker 
et al. 1996) , although this turnaround value is presently un- 
certain. After that, it is expected to oscillate slowly around 
zero with very small amplitude. With such a behavior, it is 
not obvious to predict the optimal weight without explicit 
numerical calculations. The result will depend on the size 
and the geometry of the catalog. The only simplification 
is that if the finite volume error were dominant, the corre- 
sponding optimal weight would be independent of the order 
k. 
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3.4 Practical Measurements 

As shown in § 3.2, even though it was considerably simplified 
with reasonable hypotheses, the calculation of the optimal 
weight and the cosmic error for Fk needs prior knowledge 
of Fi, I < 2k (including Fk itself !) and of £(r) (and of 
course of the selection function). As a result, we propose two 
procedures to perform a practical measurement in a galaxy 
catalog: 

(i) One possibility is to choose a model of large scale 
structure with given values of Fi, I < 2k and £(r). These 
values are used as input parameters in equation ( |3l| ) to com- 
pute the optimal weight and the corresponding theoretical 
cosmic error [eq. (p7|)]. The value of Fk measured with this 
weight can be compared to the theoretical one, given the 
theoretical cosmic error. This procedure should be applied 
again to each competing model. 

(ii) An alternative iterative approach starts with measur- 
ing directly the values of Fi, I < 2k in the galaxy catalog, 
with a given weight, for example ui = 1. With these val- 
ues of Fi , one would solve equation (^T|) to find the optimal 
weight for measuring Ft, and thus perforin a more accu- 
rate measurement. This can be repeated until convergence 
is achieved. We conjecture that a small number of iterations 
should be sufficient in practice. The main weakness of this 
model independent approach is that there is a cosmic error 
on the determination of the optimal weight itself. As dis- 
cussed in SC (and earlier by CBSI and CBSII), the cosmic 
error is likely to be systematic, which implies that the opti- 
mal weight estimated this way might be biased. This bias, of 
course, would only increase the cosmic error on the measure- 
ment of Ft,: the estimator for the moments is still unbiased 
by definition. 



4 EXAMPLE: THE SDSS CATALOG 

The future SDSS will be a likely proving ground of the meth- 
ods proposed in this paper. Therefore a "SDSS-like" catalog, 
£ , is used to illustrate the applicability of the theory outlined 
so far. This section is organized as follows. § 4.1 presents the 
properties of the hypothetical survey considered, i.e., its lu- 
minosity function, geometry, and the underlying statistics 
(function £(r), factorial moments Fk). Then, in § 4.2, the 
optimal weight is computed with the corresponding cosmic 
error. A simple approximation for u}(r) is found which prac- 
tically minimizes the cosmic error and avoids solving nu- 
merically integral equation (|3l|). Finally, in § 4.3, we show 
the advantages of our optimal strategy of using the full cata- 
log compared to the alternative of extracting volume limited 
subsamples. Details on the method used to solve numerically 
equation fell) are given in Appendix B. 



-1.07, 



: 0.0156h 3 Mpc" 3 , 



4.1 Properties of the Survey 

The luminosity function of the catalog £ is assumed to be 
of the Schechter form (Schechter 1976) 



<p(L/L,) = ML/L,) a exp(-L/L,), 



(34) 



with parameters taken from Efstathiou, Ellis & Peterson 
(1988, see also Efstathiou 1996) 



(35) 

where h represents the uncertainty of a factor two on the 
Hubble constant: Ho = 100 h km/s/Mpc. In what follows, 
we shall use h = 0.5, i.e. 



Ho = 50 km/s/Mpc. 



(36) 



However, the results derived hereafter should not depend 
significantly on the value of Ho- In fact, the Hubble con- 
stant influences the shape of the power-spectrum of initial 
fluctuations through the parameter F = Q.h 2 , where Q is the 
density parameter of the universe (e.g. Efstathiou, Bond & 
White 1992). 

The average number density in a thin shell at distance 
r from the observer is 



n(r) = <f>*r[a + l,L Um (r)/L,], 



(37) 



where Lij m (r) is the minimum required luminosity for a 
galaxy at distance r from the observer to be included in 
the catalog, and F is the incomplete gamma function. If K- 
correction is neglected, 



L lim (r)/L. = l 0.4(A/,-M llm+ 25+51o gl0 ^ 



(38) 



In the above equation, r is expressed in Mpc. Our choice of 
M* and the magnitude limit Mn m is 



-19.68 + 51og(f/o/100), M Um = 18.3. 



(39) 



Note that, with the above value of a, n(r) diverges at r — 0, 
formally implying that the average number density of the 
real, total galaxy distribution is infinite. This, however, does 
not affect the following calculations. 

For the geometry, we assume that the catalog covers a 
cone of depth 



iCax = 1200 Mpc 



(40) 



with elliptic basis defined as follows in Cartesian coordi- 
nates: 

2 / „, \ 2 

(41) 



( * y+M-v 

V tan 9m ) V tan 9 m J 



65°, m = 55°. 



(42) 



With the above choice of the parameters the catalog would 
typically contain 



N, 



obj 



830000 



(43) 



objects. 

The underlying statistics is chosen as follows: 



(i) The two-point correlation function is an estimate of 
the nonlinear matter autocorrelation function for the stan- 
dard Cold Dark Matter (CDM) model, which actually con- 
tains the Ho dependence of the subsequent calculations. It 
is computed by Fourier transform of the power-spectrum 
obtained from the nonlinear ansatz of Peacock & Dodds 
(1994). The normalization is chosen such that the variance 
in a sphere of radius 8 ft -1 Mpc is unity. The choice of the 
two-point function fixes its average over a cell £ and there- 
fore the factorial moment of order 2. 

(ii) As the optimal weights for the measurement of Fk, 
k < 4 will be discussed, prior knowledge of the statistics up 
to k — 8 is needed. Factorial moments Fk, 3 < k < 8 are 
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derived from the measurements of Gaztanaga (1994) of the 
coefficients Qn- 



Q s = 1.35, Q 4 

Qe = 6.7, 
By definition, 

Q 1 



T N v f 



= 2.33, Q 5 = 4.02, 
Q 7 = 10, Qs = 12. 

d 3 ri . . . d 3 rjv£]v(ri, . . . , r N ), 



(44) 



(45) 



where £jv is the iV-point correlation function (see e.g. Pee- 
bles 1980, p. 138; BS) and 



(46) 



In general, Qn can depend on scale. We assume that the hi- 
erarchical model applies, which implies that Qn = constant 
independent of scale. However, the cosmic errors should be 
fairly robust against small variations of the cumulants. 
The generating function of Pjv [eq. (|l^)] can be expressed 

as 



P(x) = exp 



5> 



1) N TnQn 



(47) 



This is a completely general equation, true outside of the 
hierarchical model framework as well (BS; Szapudi & Szalay 
1993a). 

The factorial moments Fk can be computed from the co- 
efficients Qn, N and £ through Fk = [d/dx] k P(x)\ x =i [see 
eq. §]. 

The volume limited subsamples £y L extracted from our 
virtual catalog £ are of depths R4 = 200, 400, 600, 800, 1000 
and 1200 Mpc. Typically, they are expected to contain re- 
spectively N ohj ,i ~ 40100, 138 500, 208 800, 211 800, 164 900 
and 104 200 objects. 



4.2 The optimal weight and the corresponding 
cosmic error 

The assumed catalog has a non-spherical geometry similar 
to the future SDSS. Therefore the weight should depend 
both on the angles and the distance from the observer r, 
except when the finite volume error contribution is negligi- 
ble. For simplicity, however, only the radial direction will 
be considered, i.e. the cosmic error will be minimized in the 
subspace of functions w(r). After integration over the angles, 
the cosmic error becomes 



A|M + A|M + A^,0], 



(48) 



with 



Ae[w] 



Afjk 



r\dr\r\dri 



Cl{ri)u(ri)Cl(ra)cj(r2)i(ri,n), 
— T— / r drQ(r)ui (r), 



V 



r dr£l(r)ui (r)A D (r) 



(49) 
(50) 

(51) 




500 

R (Mpc) 



1000 



Figure 1. The optimal weight ui for measuring the factorial mo- 
ment Fk{£) (k > 1) is plotted as a function of distance R from 
the observer in the case the finite volume error is dominant. The 
results displayed here correspond to I = 10 Mpc; other values 
of I are similar. The solid curves correspond to various volume 
limited subsamples £y L extracted from our SDSS like catalog £ 
(see § 4.1). The depth Ri of the subsamples increases with the 
^-coordinate of the right end point of the curves: respectively 
Ri = 200, 600, 1000 and 1200 Mpc. The latter case is valid for 
the parent sample S as well. The dashed curve corresponds to a 
catalog exactly the same as £ but covering the full sky. The dot- 
ted curve is the same, but the two-point function is assumed to 
be a power-law £(£) = (£/18) -1 ' 8 over all the available dynamic 



where Cl(r) is the solid angle covered by cells at distance r 
from the observer, -R m in, flmai denote the distance of the 
closest and furthest cell to the observer, and 



1 



f2(n)n(r 2 ) Jfi(n) Jn(r 2 ) 

smdidOidtpi sin#2d#2d</'2£([ r i + r\ 

— 2rir2(cos(<^i — Lp 2 ) sin#i sin 62 

+ cos9 1 cos 6> 2 )] 1/2 ). 

The optimal radial weight is the solution of the following 
integral equation (provided that a solution exists), 



(52) 



u duCl(u)u)(u)^(r,u) 



+ {Al + A^(r)}a;(r) + A: 



0. 



(53) 



Figure |lj shows the optimal weight given by the 
numerical solution of equation (|H^) in various situations (see 
Appendix B for details of the numerical method). The fi- 
nite volume error contribution is assumed to be dominant 
for this plot. In this case, the optimal weight is indepen- 
dent of the statistical object under study, i.e., for Fk, of 
the order k. The long dashes correspond to a catalog with 
similar characteristics as our SDSS-like catalog, but cover- 
ing the full sky, in order to have the exact solution u)(r) 
only depending on r. The dots correspond to the same situ- 
ation, but the two-point correlation function is assumed to 
be £(£) = (^/16) -1,8 over all the available dynamic range, 
with £ expressed in Mpc. The four solid curves correspond 
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to four volume limited catalogs E^l °f our SDSS-like sur- 
vey, of depths Ri = 200, 600, and 1000 and 1200 Mpc. The 
case Ri = 1200 Mpc is valid as well for the parent sample 
£. The discontinuities on the extremities of the curves are 
boundary effects related to the way we discretize the inte- 
gral equation (^3|). However, except for the left extremity 
of the dotted curve, they are likely to express the fact nu- 
merically that the optimal weight from finite volume effects 
is singular at the edges of the survey, at least for the right 
extremity of the dotted curve (see Appendix B.2). There are 
also some small irregularities on the solid curves, but these 
are random fluctuations due to the finite number of steps in 
the Monte-Carlo simulation used for computing the angular 
average ( p^ (see Appendixes B.l, B.2). Apart from these 
details, we see that the optimal weight is a smooth function, 
close to unity. Actually, taking uj — 1 gives almost the same 
value as the optimal weight for the finite volume error, at 
least for all the examples considered here. Thus a homo- 
geneous weight u) = 1 approximately minimizes the finite 
volume error in the space of radial weights u)(r). Rigorously, 
this result is not necessarily true for a catalog with a com- 
plicated geometry. However, it should be valid if the catalog 
is compact enough, which will be assumed in the following. 

As a result, we propose the following approximation for 
the optimal weight, in the general case: 

w(r)ocl/[A| + A| + A?,(r)] . (54) 

The coefficient of proportionality is determined by the nor- 
malization (^|). This approximation is quite natural, because 
it properly takes into account the relative weight of each 
contribution to the cosmic error. If one of them is dominant, 
then uj converges to the correct solution of the integral equa- 
tion ©, although only approximately if the finite volume 
error is dominant. 

Figure ^ shows the optimal (radial) weight obtained 
from the numerical solution of the integral equation ( |53| ) 
(solid curve on each panel). Each line of panels corresponds 
to a given value of k, which increases from top to bottom. 
Each column of panels corresponds to a fixed value of the 
scale £ (from left to right, £ = 0.1 Mpc, 1 Mpc and 10 Mpc). 
The dashes and dots display the case when the finite vol- 
ume error is negligible, and dominant, respectively. The long 
dashes show approximation They overlap surprisingly 

well with the solid curves. 

Note that at large scales, the optimal weight tends to 
unity, because discreteness effects become negligible, and 
edge effects dominant (e.g. SC). (The case of I = 100 Mpc is 
not shown, since it is quite similar to I = 10 Mpc). The de- 
parture of u) from unity on the other hand increases at higher 
order k, and smaller scales. Then discreteness effects tend 
to dominate the cosmic error (e.g. SC), and, as discussed 
in § 3.3, the weight strongly (exponentially) decreases with 
r. These arguments are partly illustrated by Figure ^. For 
each value of k, various contributions to the cosmic error 
calculated from equations (^|), (^]), (|5^) and (|5l|), using 
the optimal weight, are plotted as functions of scale. The 
solid, dotted-dashed, and long dashed-short dashed lines cor- 
respond respectively to the cosmic error, the finite volume 
error, and the edge effect plus shot noise contribution. 

In figure ^[ the cosmic error is displayed as a function 
of scale for all values of k. The solid lines correspond to the 
result given by the optimal weight and the dots to u = 1. 



There are also dotted-dashed lines almost perfectly match- 
ing the solid ones: they correspond to approximation (|54j). 
The degree of matching suggests that this is indeed an excel- 
lent approximation. Triangles, squares, hexagons and circles 
respectively correspond to k = 1, 2, 3 and 4: the cosmic error 
increases with the order k. According to the figure, u) — 1 
provides a satisfactory weighting scheme for our mock SDSS 
catalog on scales larger than ~ 1 Mpc. 

4.3 Volume limited subsamples versus full catalog 

According to the above analysis, the choice u = 1 approx- 
imately minimizes the finite volume error. As there is no 
selection effect in an homogeneous catalog, the discreteness 
error is minimized as well with u = 1; finally, so is the edge 
effect contribution (§ 3.3). This confirms the common wis- 
dom, that the optimal weight in an homogeneous catalog 
is uj ~ 1, be it volume limited or a two-dimensional galaxy 
catalog. 

In figure ^, the cosmic error on the factorial moments 
is displayed as a function of scale for our catalog £ and its 
volume limited subsamples £y L defined in end of § 4.1. From 
top to bottom, we have k = 1, 2, 3 and 4. The solid lines 
correspond to £ with optimal weight. The dots, dashes, long 
dashes, dot-dashes, dot-long dashes and long dashes-short 
dashes correspond respectively to Ri — 200, 400, 600, 800, 
1000 and 1200 Mpc. 

The figure illustrates clearly that a measurement with 
optimal weights using the full catalog yields smaller variance 
than any volume limited subsample (hereafter VLS). Small 
VLSs (with small depth) are denser than large VLSs (with 
large depth), so the shot noise error is more significant on 
the latter than on the former. The opposite is true for the 
finite volume error and the edge effect error. Large VLS are 
thus suited for probing large scales, while small VLS probe 
small scales, especially at high order k. According to this 
argument, it is possible to construct a VLS which is fine 
tuned for a particular scale. At this scale, the cosmic error 
is almost (but not quite) as small as the one obtained from 
the optimal weights from the full survey. However, this is 
not true for other scales, i.e. the dynamic range is quite nar- 
row. Therefore, a series of VLSs has to be constructed, each 
optimized for a different scale. As a result, the collection 
of VLSs can achieve almost as small errorbars as the op- 
timal measurement on the full catalog only at the expense 
of a lot more work. In summary, a single optimal measure- 
ment yields smaller errorbars more efficiently than a strategy 
based on a series of VLSs. 



5 SPARSE SAMPLING STRATEGIES 

So far we dealt with the problem of extracting information 
from existing catalogs in an optimal way. Another degree 
of freedom arises, during of the design of a survey. Next we 
will be concerned with the optimal design, especially with 
the optimal use of the available telescope time. This in turn 
inevitably leads to the issues of sparse sampling and optimal 
survey geometry. We discuss them as follows: 

(i) In the spirit of K86 (see also the recent work of Heav- 
ens & Taylor 1997), we explore the question: given a fixed 



© 1997 RAS, MNRAS 000, 



10 S. Colombi et al. 



1000 
100 

— 10 

ii 

* l 

3 0.1 
0.01 
0.001 
1000 
100 

CO 10 

II 

£L 1 

3 0.1 
0.01 
0.001 
1000 
100 

m 10 
x l 

3 0.1 
0.01 
0.001 
1000 
100 
10 

_ 1 

3 0.1 
0.01 
0.001 



| 1 1 1 1 | ! 1 1 1 | 1 | 

1=0.1 Mpc J 




p 1 ! 1 1 | 1 1 1 1 | 1 ; 

1 1=1 Mpc : 


, , , , 1 , , , , 1 , 




, , , , 1 , , , , 1 , 




500 



1000 



500 



1000 



500 



1000 



II 




500 1000 
R (Mpc) 



J i i i i L 

500 1000 

R (Mpc) 



500 1000 
R (Mpc) 



Figure 2. The optimal weight for measuring in our virtual SDSS like catalog (see § 4.1) is plotted as a function of the distance R 
from the observer (solid curves). Each panel corresponds to a choice of (k,£). The order k increases from top to bottom and the scale 
from left to right. The dashed curves are in the assumption that the finite volume error is negligible, while the reverse is true for the 
dotted curves. The long-dashed curves correspond to our proposed approximation (|54h for the optimal weight. 
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Figure 3. The cosmic error is shown as a function of scale, when 
the factorial moments F^ are measured with the optimal weight 
in our virtual SDSS like catalog £ (see § 4.1). Each panel corre- 
sponds to a value of the order k. The solid, dotted-dashed and 
long-dashed short-dashed curves correspond respectively to the 
total error, the finite volume, and the edge plus discreteness ef- 
fect contribution. 



0.1 1 10 100 

I (Mpc) 

Figure 4. The cosmic error for our virtual SDSS like catalog £ 
(see § 4.1) is shown as a function of scale, when the factorial mo- 
ments Ff; are measured with the optimal weight (solid curves) , ap- 
proximation ( p4j ) (dotted-dashed curves almost perfectly match- 
ing the continuous lines), and uniform weights (dots). The tri- 
angles, squares, hexagons and circles respectively correspond to 
k = 1, 2, 3 and 4. 

amount of telescope time, how to build a statistically opti- 
mal three-dimensional magnitude limited catalog, if one has 
the freedom to sample randomly a fraction / < 1 of the 
visible galaxies? As mentioned in the introduction, a small 
sampling rate / allows the construction of a deep but sparse 
survey. This results in small finite volume and edge errors, 
but large discreteness effects. The reverse is true when / 
is large. The best compromise between these requirements 
yields the optimal sampling rate, which depends on the scale 
considered and on the statistic. Here, we extend the calcu- 
lations of K86 using a more accurate estimate of the cos- 
mic error. While originally only the two-point function was 
considered, the optimal sampling rate will be calculated for 
higher order factorial moments, Fk, k < 4. In § 5.1, to sim- 
plify the analysis, the survey is assumed to have full sky cov- 
erage. The conclusions, however, do not depend significantly 
on this assumption. We also suppose that the redshifts are 
collected individually. In § 5.2, the changes brought by mul- 
tifiber spectroscopy are discussed, as today this is the most 
widespread method for collecting redshifts. 

(ii) Following K96, § 5.3 considers the question of optimal 
survey geometry. A survey is assumed to cover a fraction of 
the sky with redshifts collected by multifiber spectrographs. 
Thus the catalog can be naturally decomposed into small 
patches corresponding to the field of view of the telescope. 
The design goal is the optimal arrangement of these patches. 
Some of the choices are compact, elongated (or VLA like), or 
quasi randomly spread over the sky. This issue is immensely 
complicated by several non-linear factors and details of the 
actual parameters of the proposed survey. Accordingly, we 
only attempt to illustrate the problem, and give an approx- 
imate solution under generic circumstances. 

5.1 Full Sky Survey 

Given the luminosity function of equation (^), and assum- 
ing, as in K86, that the time required for measuring the 
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Figure 5. The cosmic error is displayed as a function of scale 
for our virtual SDSS like catalog £ and various volume-limited 
subsamples £y L (see § 4.1), as expected for factorial moments 
Fk measured with the optimal weight. Each panel corresponds 
to a value of k, increasing from top to bottom. The solid curves 
correspond to the parent sample £. The dots, short dashes, long 
dashes, dot-dashes, dots-long dashes, short dashes-long dashes 
correspond respectively to the subsamples of depth Ri = 200, 
400, 600, 800, 1000 and 1200 Mpc. Note, that for R t = 200 Mpc, 
there is a point missing for i = 100 Mpc, as a sphere of such 
radius is too large to be included in the subsamplc. 



redshift of a galaxy is inversely proportional to the luminos- 
ity L, the total telescope time required for constructing a 
magnitude limited catalog is 

Ttotaioc / r 4 dr^T[a,Lu m {r)/L,], (55) 
Jo 

where i? max is the depth of the survey, and the redshifts are 
assumed to be collected individually. The magnitude limit 
Mii m of the catalog is related to its depth i? max through 

Mite, = 5 log 10 (^y) + Afrrf, (56) 

where R TC { and M re f are constants. This implies that 

Ttotai <x 0*-R max , (57) 

in agreement with K86. Thus sampling a fraction / of the 
galaxies for a fixed telescope time results in 

i?max = J R rcf /" 1/5 . (58) 

Given a scale I and a value of the order k, the optimal sam- 
pling rate / by definition minimizes the cosmic error of the 
factorial moment Fk(£). Before any analytical estimates of 
the optimal sampling rate, let us consider an example: a 
full-sky survey S(f), with luminosity function and statistics 
identical to § 4.1. We choose 

M Toi = 15.5, R Iet = 391 Mpc. (59) 

For / = 1 this is roughly a full sky CfA2 catalog, although 
actually denser (see, e.g. de Lapparent, Geller & Huchra 
1989). On average it contains 

iV obj (/ = 1) = iV rcf ~ 72400 (60) 
galaxies. 

For this hypothetical survey, the optimal sampling rate 
was found numerically by calculating the cosmic error as 
explained in detail in Appendix B.3. Figure ^ shows the re- 
sults for Fk, 1 < fe < 4, as a function of scale. The symbols 
(respectively triangles, squares, pentagons and hexagons for 
k = 1, 2, 3 and 4) take into account effects of the selec- 
tion function by using the approximation ( p4| ) for the opti- 
mal weight uj. The curves (respectively dots, short dashes, 
long dashes, dot-dashes for k = 1, 2, 3 and 4) have an uni- 
form selection function: they correspond to a homogeneous 
sample of same size -R ma x(/) and involving the same num- 
ber of objects N bj(f) as S(f). Apart from the small shift 
for k — 1, the curves superpose quite well to the symbols, 
showing that such an approximation is valid. This reduces 
significantly the complexity of the calculation of the opti- 
mal sampling rate. Note, however, that our choice of R la i 
[eq. ([si))] was not arbitrary: i? rc f was chosen to be twice the 
radius of the volume-limited subsample which contains the 
largest number of objects (for / = 1). The following scaling 
is thus true 

i? rof 10" 02Mlcf = constant = C ref . (61) 

With this choice of i? rc f, the sphere of radius 7? max (/) 
includes most of the detectable galaxies. With a larger 
(smaller) value of R TC i than given by equation (|5^), the 
curves on figure [] would be shifted upwards (downwards). 

Figure [] shows the cosmic error on the measured facto- 
rial moments as a function of the sampling rate / for various 
scales I = 0.1, 1, 10 and 100 Mpc. The symbols are the same 
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Figure 6. The optimal sampling rate / for measuring the fac- 
torial moment of order k is shown as a function of scale for our 
virtual survey S(f) (see text). The symbols use a realistic selec- 
tion function, whereas the curves assume uniform selection. The 
triangles (dots), squares (short dashes), pentagons (long dashes) 
and hexagons (dot-dashes) correspond to k = 1, 2, 3, and 4, re- 
spectively. 



as in figure |^. Although it is an excellent approximation for 
determining the optimal sampling rate, a uniform selection 
is inaccurate for estimating the cosmic error in general, ex- 
cept to some extent for k < 2, and for large scales otherwise. 

From figure the optimal sampling rate / exhibits a re- 
markable power-law behavior up to the saturation to unity, 
except for k = 1, when it rapidly converges to a value smaller 
than unity at small scales. Moreover, / increases with the 
order k, corresponding to the increasing relative contribu- 
tion of the discreteness error with k. These features can be 
further explained by analytical calculations as follows. 

According to the previous findings, calculations will be 
simplified by assuming a uniform selection. This appears to 
be good approximation for determining /. The analytical 
formulae of SC yield the relative cosmic error on the mea- 
surement of Fk, k < 3. This can be used to find the optimal 
sampling rate in the weakly and highly non-linear regimes. 
The details of the calculations can be found in appendix C. 

In the highly nonlinear regime, £ 3> 1, i.e. at small 
scales, edge effects are expected to be negligible compared 
to finite volume effects (e.g. SC). The optimal sampling rate 
yields the best compromise between discreteness and finite 
volume effects. If R rc f is sufficiently large, £(R max ) is ex- 
pected to exhibit an approximate power-law behavior. For 
a Harrison- Zeldovich power-spectrum (|5fc| 2 ) tx fc, 
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(62) 




The power-law behavior of the average correlation function 
in the nonlinear regime is a good approximation [eq. (jjj])], 
and, although not absolutely necessary, simplifies the com- 
putations: 

= (£/e y, i <, to. (63) 



Figure 7. The cosmic error on the measured factorial moment 
of order k is displayed, as a function of the sampling rate / for 
our virtual survey S(f). Each panel corresponds to a given choice 
of scale I = 0.1, 1, 10 and 100 Mpc from top to bottom. As in 
figure ^, the symbols use a realistic selection function whereas the 
curves correspond to uniform selection. 
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With the above hypotheses, the optimal weight, when not 
saturated to unity, is 



/ 



5k 



TIL «7(*:-l) 



5(3--y)(fc-l) 
^ 5fc + Tl/ -3 



£ > 1. (64) 

In this expression, the quantities a° and are numbers 
depending on the order k and on Qi, I < 2k. Their ratio 
writes, for k < 3, 



„D D 



1 

8Q? 



a? 



Q 3 



47.4Q 6 



(65) 



*1 "2 "3 

Equation (^i|) is not valid for the Gaussian case, except for 
k — 1. Then / is independent of scale when £ ^> 1, ex- 
plaining the result obtained for the triangles and the dots in 
figure H. 

In the weakly nonlinear regime, £ <C 1, i.e. at large 
scales, edge effects typically dominate over finite volume ef- 
fects (e.g. SC). Thus the optimal sampling rate / results 
from a competition between edge effects and discreteness 
effects (see Appendix C for the details). At large scales, £ is 
not a power-law, (except for very large scales, I J; 100 Mpc), 
neither is the optimal sampling rate 



2k 2 Ri 



(66) 



3a£ iV ref £P : 
where k 2 /af is slowly increasing with k: 

"4^0.23, -J^O.26. (67) 

"1 ut 2 "3 

For 7 < 3 the optimal weight is a decreasing function 
of scale. It depends on the order k considered, and, for non- 
linear scales, on the details of the higher order statistics 
through the ratio aj?/af . However, it has a weak depen- 
dence on the total telescope time Ttotai according to the 
following argument: Ttotai oc R^ a{j N TC { tx R^ cf , therefore in 
the highly nonlinear regime [eq. (p4h], 

/ocT,*^, (68) 

constituting a weak dependence of the optimal sampling rate 
on the total observing time, at least for k > 2. In the weakly 
nonlinear regime, ^< 1, / does not depend at all on T tota i. 
Consequently, the results displayed in figure ^ are more gen- 
eral than initially suspected: they should be roughly valid 
for any full sky survey of depth larger than a few hundred 
Mpc (so that £(-Rmax) is approximately a power-law), to the 
extent that the assumptions we have made for the underly- 
ing statistical properties (see § 4.1) are realistic. 

To give orders of magnitude, let us compute numerically 
what would be typically the optimal sampling rate from for- 
mulae ( p4] ) and ([i6]). Assuming Ho = 50 km/s/Mpc, a rea- 
sonable choice for the correlation length in equation ( p^ ) is 
Lo — 50 Mpc. The same values are taken as in equations 
@ and @ for R Ict and N rc{ . With 7 = 1 A io = 16 Mpc 
and the values of Qn quoted in equation (|44|), from equa- 
tion 
each va 



4) the optimal weight fu is obtained corresponding to 
ue of k in the highly nonlinear regime: 



h ~ 0.05, 

h ~ min(l,0.2^- a55 ) 

f 3 ~ min(l,0.3^" a75 ) 



(69) 
(70) 
(71) 



Setting £ = 10 Mpc in equation ( |70[ ) gives j% ~ 1/18, in 
rough agreement with figure ^. This result is similar to the 
findings of K86. This is not surprising, although his calcu- 
lation was done for rather larger scales I ~ 30 Mpc. In the 
weakly nonlinear regime, equation ( |66| ) must be used. For a 
CDM spectrum normalized to COBE (£(16 Mpc) ~ 1.22 2 , 
see, e.g. Bunn & White (1997)) and I = 100 Mpc, f 2 ~ 1/135 
would be optimal: a quite small sampling rate, in rough 
agreement with figure ^| 

More importantly, however, the sampling rate has to 
be optimized for range of scales and statistics, not only 
for fixed values of k and I. The optimal sampling rate for 
(k,l) — (2, 100), / ~ 1/130, is not optimal for other scales 
and orders: according to figure ^ (symbols), it dramatically 
penalizes small scales, especially when k is large: for k = 4 
and £ — 1 Mpc, the error is about one order of magnitude 
larger than for / = 1. Conversely, / ~ 1 is a good choice for 
small scales, however it would increase the errors on large 
scales by a factor of 2 ~ 3. A good compromise seems to be 
/ of order 1/10, in approximate agreement with the initial 
findings / = 1/20 of K86. This choice unfortunately only 
optimizes for low-order statistics, k < 4. The higher is k, 
the higher should be /. In particular, if one wants to ana- 
lyze the probability distribution function of the density field 
in terms of shape (e.g. Bouchet et al. 1993, CBSII), perform 
cluster selection (Szapudi & Szalay 1993a, Szapudi & Sza- 
lay 1996), or any other investigation which depends on the 
full hierarchy of the factorial moments, a sparse sampling 
strategy seems inappropriate (as already stated in CBSII). 
In summary, although for low order moments a sampling 
rate / < 1 can be found, which globally optimizes the mea- 
surement, higher order moments require full sampling. This 
conclusion is further supported by the arguments of the next 
section, where it is shown that, with the advent of multifiber 
spectroscopy, there is more to be lost by undersampling than 
any possible gain. 

Previously, we assumed that the telescope time t b s re- 
quired to measure the redshift of a galaxy of luminosity L 
was proportional to 1/L. This assumption probably breaks 
down for faint objects when the background noise domi- 
nates. Then the scaling t h B tx 1/L 2 , as used by Heavens & 
Taylor (1997), would probably be more appropriate. Using 
this scaling does not significantly change the previous results 
and conclusions. The greatest difference is at large scales, 
where the optimal sampling rate becomes at most twice as 
small compared to figure [], where i bs oc 1/L was assumed^]. 
Note also that when t b s oc 1/L 2 , assuming uniform selec- 
tion is not seen to be a good approximation anymore, thus 
analytical estimates similar to equations (^34|) and (^6|) arc 
quite inaccurate. 



5.2 Multifiber spectroscopy 

The above calculation was assuming that the redshifts are 
collected individually. In today's astronomy, the problem is 
complicated by the fact that multifiber spectrographs are 
used to collect redshifts. With this technique it is possible 
to measure simultaneously iVs redshifts in a patch of the 



t This comparison supposes that the optimal sampling rate does 
not depend on total telescope time. 
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sky of fixed angular size fis- This limits the validity of the 
previous calculations to the case when the typical number 
of candidates in such a field, iVcan oc f 2 ^ 5 , is larger than TVs, 
as already discussed in K86j§]. 

Even if the available telescope time is too short to allow 
iVcan Ns , a sparse sampling strategy still can make sense, 
ft is possible to spend more time per field in order to go 
deeper, and thus allow a smaller value of /. There is a price 
to pay for such strategy: if T to tai is fixed, the total solid angle 
covered by the survey will be reduced. More details of this 
argument can be found in K86. Here let us only remind the 
scaling of the relevant quantities with the sampling rate, 
such as the depth i? max of the catalog, its solid angle Q and 
its volume V = f2i?^ ax /3: 



Rrcff 



-1/3 



2/3 



V = Kef/ 



-1/3 



(72) 



Similarly to K86, we are going to see how these new rela- 
tions affect the optimal sampling rate, compared to § 5.1. 
Just as before, uniform selection will be assumed, because it 
was shown that the details of the selection function do not 
influence the results. Unfortunately, the varying shape of 
the catalog renders the calculation of the finite volume error 
costly: for each value of /, Q changes, and a new calcula- 
tion of the two-point correlation matrix £(ri, r%) is required 
[eq. (p2])]. This would go beyond the scope of this paper. 
Instead the finite volume error is simply calculated for a 
spherical catalog of same volume V. This is a reasonable 
approximation if the catalog is compact enough, although it 
would obviously fail for the extreme case of a pencil beam 
survey. We shall partly come back to that problem in § 5.3. 

Similarly to § 5.2, and after the calculations detailed in 
Appendix C, the result in the highly nonlinear regime is 



9k-3a»N-jLl-+ 3 < k -V 



1L 

?» l, 



fit 



tn( k - 
■o 



9k + 7i-3 



9(3-f )(fc-l) 
9k + y L -3 



where the typical size L ro f of the catalog is defined as 
1/ - 47r r3 

Vref = "g-^ref- 



(73) 



(74) 



In the weakly nonlinear regime, £ <C 1, the edge effects are 
likely to dominate over finite volume effects. To leading order 
in v/V the solution is insensitive to the shape of the catalog, 
although as shown in § 5.3, this is not true in extreme cases. 
The optimal sampling rate is 



/ 



2k 2 L 



3 

ref 



of £P 



(75) 



Equations (^) and (jTH]) are quite similar to eqs. (^) and 
( pij ) in § 5.1. For example, in the weakly nonlinear regime, 
the optimal sampling rate is roughly three times larger for 
multifiber spectroscopy, than in § 5.1 (instead of the factor 
two of K86). In the highly nonlinear regime, the difference is 



§ By taking into account the galaxy density fluctuations from 
patch to patch over the sky, a more realistic constraint would be 
an + ^(^s)] *2 where K;(fig) is the average 
of the angular correlation function over the patch. This condi- 
tion accounts for the expected Poisson fluctuations and intrinsic 
correlations of the density field, up to second order. 



apparently smaller, although this is probably partly due to 
our way of estimating the finite volume error, and changes 
slowly with scale. For multifiber spectroscopy, / is typically 
twice than for individual collection of redshifts for k = 1, 
30% more for k — 2 and 20% more for k — 3. In the weakly 
nonlinear regime, our estimate of the cosmic error is accu- 
rate enough to compute the gain on the errors for optimal 
compared to full sampling, even though uniform selection 
was used (see lower-panel of Fig. For a CDM spectrum 
normalized to COBE at £ = 100 Mpc with the correspond- 
ing optimal sampling rate fi ~ 1/45, the gain is merely 
1.5 in reduction of AF2/F2. This is to be compared with 
2.8 for individual redshifts. These results qualitatively agree 
with K86: the optimal sampling rate is increased with multi- 
ple collection of redshifts compared to individual collection, 
and the corresponding gain in the cosmic error is smaller. 
This strengthens the conclusions of § 5.1: even if a sparse 
sampling strategy could be optimal for special applications, 
such as measuring low-order statistics, the resulting gain is 
too small compared to the corresponding increase of errors 
for high order statistics which are more sensitive to sam- 
pling. Another important point to note is that sparse sam- 
pling strategy consists of measuring deeper redshifts, thus 
less controllable systematic errors are also likely to increase, 
which are not included in the previous discussions. 



5.3 Survey geometry 

So far we neglected the dependence of the cosmic error and 
the corresponding optimal sampling rate / on catalog geom- 
etry: this amounts to replacing the geometry with a sphere of 
same volume. For a deep catalog covering only a small part 
of the sky, such as a pencil beam survey, this is clearly not a 
good approximation. The finite volume error depends signif- 
icantly on the catalog geometry. So does the shot noise er- 
rorpj, and particularly the edge effect error, especially when 
the cell size becomes comparable to the size of the largest 
cell contained in the catalog. This did not show up in our 
calculations to leading order in v/V: in that case, the edge 
effect and the shot noise errors do not depend on the catalog 
geometry. 

Since geometry is another degree of freedom in the de- 
sign of surveys, it can be used to achieve predetermined sta- 
tistical goals. For typical redshift surveys, the geometry has 
an "atomic" building block, Qs, corresponding to the field 
of view of the telescope. Within this, a certain number of 
spectra can be collected. The design problem which will be 
discussed in this section is (K96) : what is the optimal distri- 
bution of fields of size Sis on the sky? Note that this problem 
is relevant even to redshift surveys, which, as the SDSS, plan 
to uniformly cover a large portion of the sky. Since such a 
survey takes a long time to carry out, it is important that 
in the initial phases the individual fields should be placed in 
such a manner that as much preliminary information as pos- 
sible could be extracted. Such strategy can not only provide 
preliminary results before the full completion of the survey, 
but it gives an excellent early check, whether the envisioned 
goals of the survey can be achieved when completed. Note 



" through "edge-discreteness" effects as mentioned in the 
introduction. 
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that in such a case, the design problem acquires another di- 
mension, i.e. time, since analysis can be performed at several 
stages before full completion. Generalization of the follow- 
ing arguments for this is trivial. Some of the possibilities 
for the geometry are a compact structure, an elongated one 
(perhaps VLA-like), or a (quasi) random distribution, etc. 
Judging all possibilities is extremely difficult, not to men- 
tion that for realistic surveys other "worldly" factors play 
important roles, such as weather, dark time, season, etc., 
therefore the aim is here only to illustrate the problem and 
present some practical suggestions by solving it under sim- 
plified circumstances. 

Each field corresponds to an elementary galaxy catalog 
of volume Vs = f2si?max/3 at position hi in the sky. The 
centrally important quantity for the finite volume error, the 
integral of the two-point function over the whole survey can 
be decomposed as 

m-jQ^m+jtfii*). (76) 

In this equation, Ms is the total number of fields. The an- 
gular correlation function w(8) is defined by 

w(8) = ^J d 3 ri J d 3 r 2 £(r 12 ), (77) 

where each integral is performed on disjoint elementary 
catalogs with positions hi and hi on the sky such that 
9 = (711,712). The quantity £(£s) is the average of £ over 
an elementary catalog: 

f(L S ) = I cfVirfVa^m). (78) 
J Vs 

According to equation (^6|), values of 8ij could be chosen, 
which minimize £(L) and therefore the finite volume error. 
This choice would not affect the edge effect error and the 
shot noise error significantly: they do not depend on the ge- 
ometry of the catalog to leading order in v/V , i.e. at small 
scales. Although the solution depends on the large scale be- 
havior of the two-point function, we conjecture the best con- 
figuration to be a "glass" spread over the largest possible 
area of the sky. This configuration would maximize the dis- 
tance between the fields, and, in agreement with intuition, 
it would decrease the coherence of the fields as much as 
possible. Clearly, the "compact" configuration is the worst, 
followed by the "line" configuration. These conclusions are 
very similar to K96, who discussed optimal survey strate- 
gies for measuring weak lensing related two-point statistics 
in Fourier space. 

However, the above arguments are relevant only if the 
considered scale i is small compared to the size ^ max of the 
largest cell included in the catalog. When £ becomes com- 
parable to < mss , our calculations of the full cosmic error are 
not valid anymore, although the conclusion remain approx- 
imately valid for the finite volume error. The exact calcula- 
tion of the cosmic error in the regime where I S5 ^ max is quite 
tedious (see appendix B of SC for an example). Qualitative 
description, however, can be given. The shot noise error and 
the edge effect error increase with v/V, where V is the vol- 
ume occupied by positions of cells included in the catalog. 
This ratio increases toward larger scales both because v is 
increasing and V decreasing. This latter effect is obviously 
more prominent for "glass" , or "line" configurations. The 



edge effect error (and the shot noise error) is therefore more 
important at large scales for these configurations, than for 
the compact geometry. 

As before, the choice of the optimal geometry of the 
catalog results from the competition between various con- 
tributions to the cosmic error, namely the finite volume er- 
ror against the edge effect and the shot noise errors. The 
details of the sampling strategy depend on the value of f2s, 
on total telescope time, the needed dynamic range in scales, 
the statistical aim, and the clustering properties of the uni- 
verse. A good compromise could be a survey composed of 
several compact subsamples of size Of > fls, spread over a 
large fraction of the sky on a glass like structure. The op- 
timal value of Qf is a complicated function of the various 
parameters of the survey. While the details of this difficult 
issue are left for subsequent research, it is clear that the 
above arguments give an approximate solution once the sur- 
vey parameters and the scale range are fixed. 



6 DISCUSSION 

In this article, we examined extensively the measurement 
of low-order moments of the count probability distribution 
function (CPDF) in three-dimensional magnitude limited 
galaxy catalogs, with special emphasis on issues related to 
the effects of the selection function. A new estimator was 
proposed: the weighted factorial moment of count-in-cells, 
corrected for selection effects [eq. (g)]. The following ques- 
tions were studied in detail: 

(i) Given a catalog, what is the optimal way of measuring 
factorial moments? 

(ii) What is the optimal sampling strategy for construct- 
ing a catalog to measure factorial moments? 

Both of these question are intimately related to the variance 
of the proposed unbiased estimator. Thus the cosmic error 
was computed, by extending the calculations of Szapudi & 
Colombi (1996, SC) for the new estimator which includes a 
local statistical weight, and for a general selection function. 
Similarly to SC, the hierarchical model and local Poisson be- 
havior was used to simplify the calculations. The local (but 
not the global) variations of the selection function were ne- 
glected as well. This allowed for the first time the accurate 
estimation of the optimal weight, i.e. the one which mini- 
mizes the cosmic error [eqs. ( p7j ) to ([jlj)]. 

To illustrate numerically the first question (i) , a virtual 
SDSS-like catalog was considered. For this, we demonstrated 
the advantages of our new estimator, which extracts all the 
relevant information from the catalog at once. This method 
not only yields higher accuracy in a wider dynamic range 
than the more traditional volume limited method, but it is 
significantly more efficient as well. As an added benefit, our 
calculation of the cosmic error finds the best volume limited 
strategy if other reasons necessitate its use. A remarkably 
simple expression for the optimal weight uo was found, which 
provides an excellent approximation to the solution of the 
corresponding integral equation 

w(r)ocl/[A!+AS.+A?,(r)] , (79) 

where A P , A| and Ap(r) are respectively the finite volume, 
the edge effect and the discreteness errors. As a result, the 
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optimal weight for an homogeneous sample, i.e. with uniform 
selection function, is very well approximated by uj = 1. Un- 
like TV-point correlation functions, it is impossible to correct 
for edge effects with our estimator, due to the finite exten- 
sion of the smoothing kernel. A different approach, however, 
is presented elsewhere (Szapudi & Szalay 1997b). 

Interestingly, it appears that u = 1 is a good approxi- 
mation for the optimal weight in the SDSS catalog, at scales 
larger than ~ 1 Mpc. As illustrated by figure 0, the corre- 
sponding cosmic error on the estimates of (p" should be 
rather small, of order 1 — 2% for k — 2, 3 — 5% for k — 3 
and 5 — 10% for k = 4. These results, however, depend on 
the details of the model we used for clustering properties of 
the universe. 

Note the similarity of our results with those of Feld- 
man, Kaiser & Peacock (1994, hereafter FKP), who did sim- 
ilar calculations for finding the optimal weight to measure 
the power-spectrum P(k) = (\8k\ 2 ), where 6k is the Fourier 
transform of the density contrast S — p — 1. They assume 
that the modes are normally distributed in Fourier space 
and consider modes corresponding to scales small compared 
to the size of the catalog. Their result is 



oj{r) oc 



l/n(r) + P(fc) : 



(80) 



in excellent agreement with equation (|79[). Indeed, from 
equation (|8o|), the optimal weight presents a plateau at small 
r and decreases exponentially at large r. The same conclu- 
sions apply as well if we compare our optimal weight to the 
one obtained for measuring optimally the two-point correla- 
tion function £(r) (e.g. Efstathiou 1996; Hamilton 1997a). 

To address question (ii), we considered the sparse sam- 
pling strategy proposed by Kaiser (1986, K86, see also Heav- 
ens & Taylor 1997). This consist of randomly sampling a 
fraction / of the visible candidates to build a three dimen- 
sional catalog. The optimal sampling rate / minimizes the 
cosmic error, when the total available telescope time is fixed. 
A small sampling rate / results in a sparse but deep catalog, 
therefore decreasing the finite volume and the edge effects. 
Then discreteness error becomes the limiting factor. The 
reverse is true for high sampling rate. To measure factorial 
moments up to order four in a reasonable range of scales such 
as 1 Mpc <; I <; 100 Mpc, / = 1/10-1/3 yields a good com- 
promise between the above effects, depending whether the 
redshifts are collected individually, or, as usual today, collec- 
tively. This is in qualitative agreement with K86, although 
slightly larger for the following reasons: our calculation of 
the cosmic error is more realistic; we consider higher order (4 
vs. 2) statistics than K86; we emphasize dynamic range from 
small scales to large scales, the latter being solely considered 
by K86. Note that the optimal sampling rate increases with 
the order k of the statistic considered and decreases with 
scalejjj]. Therefore, when properties sensitive to higher than 
fourth order are considered, such as the CPDF shape in the 
nonlinear regime, cluster selection, higher, most likely full 
sampling will be optimal (Colombi et al. 1995, Szapudi & 



II Except for very large scale £ a few hundred Mpc. In this 
regime, one expects £ oc l~ 4 [eq. (p3)] implying f oc £ [eqs. (pq), 
j75p], in agreement with the calculations of Heavens & Taylor 
(1997) concerning the power-spectrum. 



Szalay 1996) , all the more since we have shown that a sparse 
sampling strategy yields only a marginal gain compared to 
full sampling even at the largest scales. 

Finally, the design of optimal geometry for a galaxy sur- 
vey covering some fraction of the sky (e.g. Kaiser 1996, K96) 
was considered. This is an important problem even for sur- 
veys eventually covering a large, continuous portion of the 
sky, since early results can be obtained by the appropriate 
(evolving) geometry. Again, the design is governed by the 
competition between edge and discreteness effects, requir- 
ing compactness at large scales, and finite volume effects, 
requiring large sky coverage. For a reasonably deep survey, 
where the finite volume error is expected to be quite small, 
the compact geometry is probably the best choice, because it 
allows the largest possible scale range for the measurement 
of the moments. A glass like configuration, where the survey 
is spread out over the sky in pencils beam subsamples, would 
reduce the finite volume error. It would be, however, subop- 
timal for edge effects at large scales, thus constraining the 
dynamic range of the measurement. A possible compromise 
is to increase the size of each subsample, for compactness, 
and still spread them over the sky in a quasi random glass 
like structure, for reducing the finite volume effects. The de- 
tails of such a construction, however, depend so much on the 
goals of the survey in terms of scale range, desired statistics, 
and predetermined conditions as well, such as the field of 
view of the telescope, the number of fibers of the multifiber 
spectrograph, and finally on the statistics of clustering in the 
universe, that we only attempted to illustrate the problem 
by solving it for the conceptually simplest cases. 

While the calculation presented in this work provides 
sufficient details for most practical applications, there are 
several points where generalization or extension could pro- 
vide more accuracy, if needed: 

(i) We did not investigate the dependence of the results 
on the details of the clustering in the universe, since a partic- 
ular model was assumed to illustrate our method (see § 4.1). 
As shown by SC, the cosmic error on statistics of order k 
depends on statistics of order I < 2k. The dependence be- 
comes stronger at small scales, possibly altering the optimal 
weight u). At very large scales, where the non-Gaussianity 
is less important, the results are only weakly sensitive to 
higher order statistics. At the level of approximation used 
in this paper, however, our qualitative results both on opti- 
mal weight and sampling rate are expected to be valid for 
a broad spectrum of realistic statistics. Extended perturba- 
tion theory (Colombi et al. 1997) can be used to estimate 
the sensitivity of the results within reasonable limits. An 
overall amplification for the optimal sampling rate of a fac- 
tor 4 is predicted for the variance at nonlinear scales when 
one passes from an effective spectral index n e fi = —9 to 
n e « = —1 (corresponding respectively to scale-free initial 



conditions (\S k \) oc feu near with Tin, 



-2 and n\ ln 



= 0). 



The calculations in this paper correspond to an intermedi- 
ate value n e g ~ —3 (corresponding to nii ncar = — 1), which 
is most supported by observations. 

(ii) So far, we employed the hierarchical tree model as 
an approximation to simplify the calculations of the cos- 
mic error. Another possibility is to use tree-level perturba- 
tion theory predictions (see, e.g. Bernardeau 1994, 1996), 
which are seen from TV-body simulations to be valid in the 
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regime £ 1 (e.g. Juszkiewicz, Bouchet & Colombi 1993; 
Bernardeau 1994; Baugh et al. 1995; Gaztahaga & Baugh 
1995; Colombi et al. 1996, 1997). The use of perturbation 
theory predictions for computing the cosmic error will be 
presented elsewhere (Szapudi, Bernardeau & Colombi, 1997, 
hereafter SBC). 

(iii) As mentioned in the introduction, redshift distortion 
was completely disregarded, although it affects the measured 
statistics in two different ways, depending on the scaling 
regime considered. At large scales, coherent flows enhance 
the density contrast along the line of sight, therefore in- 
creasing the amplitude of the iV-point correlation functions. 
At small scales, on the contrary, the "finger of god" effect 
by the high velocity dispersion of large clusters tends to 
smear out clustering and reduce the amplitude of the N- 
point correlation functions. The net effect on the parame- 
ters Qn is a decrease at small scales and no change at large 
scales (e.g. Matsubara & Suto 1994; Hivon et al. 1995). One 
consequence is that, at variance with what is expected from 
real space measurements in iV-body simulations, function 
Qn(£) is fairly flat in redshift space. As a result, the hier- 
archical model used along this paper appears to be an even 
better approximation for estimating the cosmic error in red- 
shift surveys. However, a correction for redshift distortions 
would be needed to transform the measured moments into 
configuration space, which is left for future work. 

(iv) Possible dependence of the clustering on luminosity, 
and morphology was neglected as well. If clustering of galax- 
ies increased with their luminosity (as some observations 
might indicate), magnitude limited surveys would have in- 
homogeneous clustering properties. Indeed, only bright ob- 
jects are seen deep in the catalog, thus clustering would 
change with depth implying a systematically increasing bias 
with distance from the observer. In that case, our estimator 
[eq. (^|)] would have to be corrected for this bias, otherwise 
the volume limited approach is the only alternative. In fact, 
if the bias is unknown a priori, the volume limited approach 
provides a way to estimate it. Using an already existing for- 
malism (e.g. Bernardeau & Schaeffer 1992), one can proba- 
bly directly include the effects of biasing in the calculation 
of the cosmic error, but this is left for future work. A similar 
effect is caused by the change of clustering with 2 in a very 
deep catalog (Suto 1997). 

(v) It is worth to emphasize again, that our purely statis- 
tical approach cannot account for systematic observational 
errors, introduced by the instruments, uneven sky, seeing, 
emission from our own galaxy, inappropriate K-corrections, 
problems with star galaxy separation, inaccuracy of red- 
shifts, etc. It is only hope, that in a well controlled sample, 
these effects do not constitute the dominant source of er- 
ror, or, turning it around, this condition yields well defined 
accuracy goals for observations. 

(vi) There are known alternatives over constructing an 
estimator F k from the CPDF. For instance, for many ap- 
plications, the connected moments, or cumulants Sn (or, 
equivalently Qn = Sn /N N ~ 2 ) are desirable. These numbers 
are ratios of combinations of factorial moments (for example 
Ss, = {5 Z )/{5 2 ) 2 ). Kim & Strauss (1997) proposed an alter- 
nate method to measure Sn in galaxy catalogs, by fitting 
the Edgeworth expansion (e.g. Juszkiewicz et al. 1995) con- 
volved with a Poissonian to the measured CPDF. They find 
that this method, less sensitive to the tails of the CPDF, 



is more robust than the traditional moment method which 
was refined in this paper. However, their error estimation is 
quite ad-hoc, even if normalized with iV-body simulations. 
Moreover, their method is valid only in the weakly nonlinear 
regime £ 1, and, in its present form, can be applied only to 
volume limited catalogs, in contrast with our indicator F k . 
It would be interesting to apply the formalism developped in 
this paper directly to the cumulants Sn and see how the re- 
sults differ from what we obtained for the factorial moments 
(SBC). 

(vii) The CPDF depends on the full hierarchy of facto- 
rial moments. In a finite galaxy catalog, however, only a 
limited amount of information is available, due to the cos- 
mic error and other sources of noise. Thus data compression 
must be possible without loosing significant information. For 
example, information content in the tails of the measured 
CPDF is small because of large fluctuations associated to 
rare events (e.g. Colombi et al. 1994). Data compression is 
already used for applying maximum likelihood tests to the 
galaxy distribution and the cosmic microwave background 
(e.g. Bond 1995; Vogeley & Szalay 1996; Tegmark, Taylor 
& Heavens 1996; Jaffe, Knox & Bond 1997). For counts- 
in-cells statistics, such a compression is complicated by the 
non-Gaussanity of the likelihood function at small and in- 
termediate scales (e.g. SC). 

(viii) A problem somewhat related to (vi), the estimators 
Fk are clearly not statistically independent (i.e. (F k F k i) ^ 
F k F k , if k # k', (F k F k ,F k „) F k F k ,F k „, k ± k' / k", etc.), 
therefore they do not provide independent tests of theoreti- 
cal models. As a result, the question of determining the best 
higher order statistics estimators beyond the two point cor- 
relation function (or its Fourier transform, the power spec- 
trum) remains opened. Ideally, one would like to create hier- 
archies of statistically independent estimators. Technics to 
build such hierarchies are already extensively developped for 
estimating two-point statistics in the galaxy distribution and 
in the cosmic microwave background (e.g. Hamilton 1997b; 
Knox, Bond & Jaffe 1997; Tegmark & Hamilton 1997). In 
our case, a first step would consist in finding estimators A k 
for which the covariance matrix (A k A k i) — A k A k is zero 
(SBC). Even if this is probably feasable, since the calcu- 
lation of the components of this matrix is similar to what 
we did for the cosmic error, this constraint is not sufficient, 
again because of the non-Gaussian nature of the underly- 
ing statistics. Unfortunately, the calculation of higher order 
covariance matrixes such as (A k A k i A k n) — A k A k i A k n looks 
quite tedious. 

This paper constitutes the second step in a major inves- 
tigation on the theoretical errors on counts in cells. SC laid 
the groundwork for all subsequent calculations, and here, 
their formalism was extended and applied for magnitude 
limited redshift surveys with realistic selection function. We 
proposed a new set of estimators for the factorial moments, 
which includes compensation for the effects of the selection 
function and a minimum variance weighting. The integral 
equation for this weight was solved, an excellent approxima- 
tion found, and the corresponding errors were calculated. 
Optimal sparse sampling strategies were considered as well, 
and it was shown that in most cases the decrease in variance 
does not outweigh the disadvantages of the corresponding 
information loss. Therefore full sampling is advocated for 
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most applications requiring high order statistics. Finally, the 
question of optimal survey geometry was addressed, and we 
found that quasi random distribution of fields is a reason- 
able choice, when a small fraction of the sky is covered by 
the survey. 
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APPENDIX A: CALCULATION OF THE 
COSMIC ERROR 

This section generalizes the calculation of the cosmic error 
by SC to the case of a non-uniform selection function, and 
for our estimator which includes local (spatial) statistical 
weight. 

Following CBSII and SC, equation (^) is separated into 
two parts, according to whether the cells overlap or not 

F — ^overlap ~t~ -^disjoint 7 

(Al) 

with 

^overlap = 4- / d 3 rid 3 r 2 . . . , (A2) 

V Jr 12 <2e 

^disjoint = i / d 3 r 1 d 3 r 2 ■ (A3) 

and ri2 = \ri — r%\. Except when specified, we assume that 
the cell size is small compared to the survey size. Our cal- 
culations will thus be valid at leading order in v/V . 



The r dependence (through TV) of quantity Tjv [defined by 
eq. (Ej)] is now explicitly written, thus 

Prx,r 2 {x,y) ~ P ri (x)P r2 (y)[l + Rr lt r 2 (x,y)] 

+ 0(e/f), (A5) 

OO 

Rr u r a (x,v) = t {x - 1) N (y - 1) M Q N+M 

M, JV=1 

rM(ri)r JV (r 2 )7VM, (A6) 

where £ = £(ri 2 ). The second approximation, hereafter BeS, 
was proposed by Bernardeau & Schaeffer (1992). It is 

Rr u r 2 (x,y)=T[(l-x)N(ri)£\ 

r[(l-y)7V(r 2 )C]C/f , (A7) 

where 

/ x - jv JV - 2 (W - 1) 
t(s)=s 2J2(s) n - 2 Qn ^ '-■ (A8) 

y n>2 

It was noted by SC that the two approximations SS and 
BeS, although quite different formally, give practically iden- 
tical results for the cosmic error on the factorial moments in 
realistic cases. Also, as shown by SC, in the case of a sample 
without selection effects and with homogeneous weighting 
(cj> = u) = 1), the relative finite volume error on Fk does not 
depend on N, neither on the average density. This means, 
as shown later, that in the more general case <j>(r) < 1, all 
^-dependent terms disappear in the final expression for the 
finite volume error on Fk. 



Al Contribution from Disjoint Cells 

The contribution to the error from disjoint cells corresponds 
to finite volume effects. It requires the knowledge of the gen- 
crating function Pr\,r 2 (x,y) of the bivariate counts in dis- 
joint cells at positions n and r 2 . The problem is that the 
average number density of galaxies depends here on the dis- 
tance r from the observer. According to the the reasoning 
of Szapudi & Szalay (1993a) in terms of Poisson processes 
[see, e.g., their eq. (5.2)], 



Pr 1 ,r 2 ( x ,y) 



expjj] 

N,M 



N\M\v N + M 



d 3 U\...d 3 UN I d 3 UN+l ■ ■ ■ d 3 UN+M 



£jV+A/(lti, . . . , Ujv+m)}, 



(A4) 



where A^ r is the average number of objects per cell expected 
at position r [eq. (g)]. There are two approximations in SC 
which are used for the bivariate generating function. We 
refer to SC for details about the underlying hypotheses in 
these approximations, which are particular cases of the hi- 
erarchical model (e.g. Peebles 1980, BS), and for those it is 
trivial to take into account the above supplementary depen- 
dence of the average densities N ri and N T2 on distance from 
the observer. The first approximation, hereafter SS, was de- 
rived by Szapudi & Szala y (1 993a, 1993b). It assumes that 



the integral in equation (A4) can be well approximated as 



N ri N T2 Qn+mT 'jv(ri)r M(rz)N M£ up to linear order in £/£. 



A2 Contribution from Overlapping Cells 

The overlapping contribution is discussed under the assump- 
tion that the variations of cj>(r) and uj(t) are small across the 
length 21. This is a reasonable approximation when the cell 
size £ is small enough. With the additional assumption of 
local Poisson behavior (SC) 



^overlap (■£ , y ) 



V Jy 



"^overlap 



N=N r 



(A9) 



where the quantity E^^^ (x, y) is the overlapping contri- 
bution to the generating function of the cosmic error com- 
puted by SC when there are no selection effects and the 
weighting function is unity. There is an r dependence ac- 
counting explicitly for the fact that the average number 
density ncj>(r) depends on the distance from the observer: 
^!vrfa p 1 ( 1 '!') i s calculated at N = N r . The overlapping 
contribution of the error can be thus inferred from the cal- 
culations of SC with a supplementary weighted integral. 



A3 Cosmic Error on Factorial Moments 

Let us now concentrate on the error on the factorial mo- 
ments, obtained from equation (|l4|). 

The finite volume contribution from disjoint cells is 



fA F F fc ) 2 



1 

fa 



d 3 rid 3 r2U>(ri)u)(r2)rj(ri, r 2 ) (A10) 
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with 

ij(ri,r 9 ) = [*(''i)0(ra)]~ afc [^]*[^]* 1 
{P ri (a; + l)P r2 (j/+l) 

Bp 1 ,r a (a: + l.y + l)}l»= 1 M)- ( A11 ) 

From equations (^) and (A6) or (A7), function r\ depends 
only on ri2, as follows 



rj(r 1 ,r 2 ) cc £(ri 2 ). 



(A12) 



Thus the selection effects cancel out in the finite volume con- 
tribution to the cosmic error on the factorial moments. The 
results of SC (see their § 4.3) can therefore be directly used 
here, just by replacing their £(L) = V~ 2 f v d 3 r\d 3 r2i{r\2) 
by the weighted average 



V 2 



ri,r 2 eV,r 12 >2e 



d i r 1 d J, r 2 'jj(r 1 )<jj(r2)i(ri2)- 
(A13) 



The quantity L stands for the effective cat alogu e size L = 
V 1 / 3 . At leading order to v/V the integral (A13) reduces to 



d 3 r 1 d 3 r 2 uj(r 1 )uj(r2)£,(ri2). 



(A14) 



For u) = 1, naturally f eff (L) — £(-£). Thus the finite volume 
error with the notation of S 3.2 is, 



2 ' Al =ul=1 F k \ 2 Lb (L) 



= a 2 Ji b{l) 



(A15) 



The calculation of the contribution (A ovcrlap F fc ) 2 from 
overlapping cells, the sum of the edge, and the discreteness 
errors (see § 3.2), is easier. Using the fact that N r oc <j>(r), 
Fk oc N k and that the relative edge effect error A| does not 
depend on N in the case cj> = cj = 1, it is straightforward to 
obtain equations (E9J) and (BQ). 



APPENDIX B: NUMERICAL CALCULATIONS 

Bl Cosmic Error and Minimum Variance Weight 

In this section the numerical solution of integral equation 
(p3|) is considered. Let us define the step A as 



A = (P m ax — Rmin) /Nbin, 

and the numbers 

Ti = P mi n + (l - 1/2)A, 1 = 1,..., Nbin 



(Bl) 



(B2) 



The solution is obtained in the space of step functions u> 
verifying 

w(r) =u>i, r£[r, - A/2, r, + A/2[. (B3) 
Equation (B3J) in its discretized form can be written as 



+ OLiLOi + Xi — 0, 

3=1 
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(B4) 



where 

S*i3 = 



r-i+A/2 prj+A/2 



£(L)V J r ._ A / 2 Jrj-A/2 

u 2 f2(it)u 2 f2(u)f (m, v)dudv, 

ri+A/2 

(Ae + Ao(ii)) u 2 Cl(u)du, 



i-A/2 
ri+A/2 



it 2 n(it)dii. 



A, = A 

/r-i-A/2 

Note that the cosmic error is simply 



(B5) 
(B6) 

(B7) 
(B8) 



The most difficult part in determining the optimal sampling 
vector (ui)i=i,...,iv bin is the calculation of the correlation ma- 
trix . A new binning is necessary for computing the dou- 
ble integral (B5). We proceed as follows: 



6 



[ulv 2 {l(uk)0.(vi)i(uk,vi)AukAvi 



2 ^ V ^ 
+ ulvfCl(uk)Cl(vi)i(uk,vi)Au k Avi] , 



(B9) 



where 
Auk = Avk 



A(N subbin - 1), 2 < k < N subbin - 1 
|A(A^ubbm - 1), k = 1 or A^subbin, 



A«t = Av k = A/N subbin , 

u k = n + (k - l)A/(A r subbii 

= rj + (I - l)A/{N subbin 

Uk = r, + (fc - l/2)A7t fc , 

vi = rj + (/ - l/2)Awj. 



(BIO) 



(Bll) 



£(m, v) = I P(fi\u, v)d/j,^ [\u 2 + v 2 — 2uvfi] 1 ^ 2 ^ 



The double summation in equation (B9) avoids calculation 
of function £(u, v) for u — v, where it might diverge (for ex- 
ample, if £(r) oc r -7 at small scales, then function £(u, u) di- 
verges when 7 > 2). It remains to compute function £(u, v), 
u 7^ v. In equation (|52|), one can define 9 as the angle be- 
tween the directions fi and r2: 

= 003(6*) = cos(<£i — 1P2) sin 6>i sin 62 + cos #1 cos 62- (B12) 

Equation (|52|) can then be rewritten as 



(B13) 



where P([i\u,v) is the probability distribution function of 
H, given u and v. (For a spherical catalog P(p,\u,v) = 1/2). 
Typically, if u and v axe large enough compared to P mm , the 
function P(fi\u, v) does not depend sensitively on the values 
of u and v. It increases with fi, from zero for — 1 < y, < /i m in, 
where /i m i n corresponds to the largest effective angular size 
of the catalog, to some maximum at n = 1, P max , depending 
on the angular size of the catalog. For our SDSS-like cata- 
log, Pmax is of order 2. The calculation of P(fi\u,v) is done 
numerically for a discrete set of values of (fj,i, rj, r k ), 1 < i < 

A^, < j, k < JVbin + 1 and r = Pmin, rAr Wn + l = Pmax- 

For each value of (j,k), a Monte-Carlo simulation is done, 
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i.e., directions fj and r& are randomly chosen such that the 
corresponding cells are included in the catalog. A reasonably 
accurate calculation, with a few percent absolute errors re- 
quires iVjtcr ~ 100, 000 iterations for = 100. For a given 
value of (fi, u, v), the estimate Pnterpoi Qu, U, v) of P(/i, u, v) is 
obtained from a bilinear interpolation between P(fii,rj, r k ), 
P(jM,rj+i,rk), P(fH,rj,rk+i), P(^i,r j+1 ,r k +i) where fn is 
as close as possible to fi, and rj < u < rj+i, r k < v < rjt+i. 
By definition, we take P(fJ>i,rj,rk) = if j = or k = 0. 

We use the same Monte-Carlo simulation as above (ex- 
cept that we use only one direction fj) to compute the effec- 
tive solid angle on an array Cl(rj), and then proceed with lin- 
ear interpolations to compute estimates f2mter P oi(M) of fi(it) 
at u Tj. 



To compute the angular average (B13) the following 
variable is defined 
1 



(7* — 2)uv 



I 2 . 2 

iu+v 



2uv/i 



l— r,/a 



(B14) 



where 7z ~ 7 is the expected power-law index of the two- 
point correlation function at small scale, or some value close 
to it. With the definitions fi(z) = z" 1 (z(fx)) , z m — z(fi = 

-1), Z M = z(pL = 1), 



Azi = 



(zm - z rn )/(N z - 1), 



2 < i < N z - 1, 



(B15) 
(B16) 



±0m - z m )/{N z - 1), i = 1 or AT Z , 

Zj = + (l - 1)(ZM - Z m )/(N Z — 1), 

the correlation matrix can be written as 

N~ 

i(u, v) ~ ^ [™( 7z - 2)^] W(2 " 72;) Pnterpoi (M^),", «) 
i=l 

C([™( 7z -2) 2l ] 1/(2 -^') A*. (B17) 
The calculation of on follows naturally from 

&i = ^ (Aedgc + Arj(ufc)) ltfenint C rpol(wfe)Allfe, (B18) 



and one proceed similarly for A^: 

Aj = A Ufcf2interpol(w&)Awfc. 



(B19) 



fc=l.JV.ubbta 

The numerical estimate of the effective sample volume reads 



V 



1 



V 

i — 1 



[(fi(r i+1 ) - fi(ri)) (rt-x-rf) /4 



+ (n(ri)ri+i - n(r i+ i)n) (rf +1 - rf) /3] . (B20) 

The same integration scheme is used to normalize the weight 
[eq. (0)]. To compute the sample volume V of our SDSS 
like catalog, a more accurate estimator is used, with exact 
calculation of fi(r) and an integral on variable r using the 
trapezoidal method with 100, 000 points. 

For the calculation of the terms A|, A| and Aq, we 
use the results of SC corresponding to the SS approxima- 
tion and to 7 = 1.8 where 7 is the assumed logarithmic 
slope of function £(r) for computing the numerical coeffi- 
cients in the different terms contributing to the cosmic error 
[see equations (53) to (68) in SC]. Note that, as discussed 
in SC, these numerical coefficients are quite insensitive to 
7 if it stays reasonably close to 1.8. Therefore the same ex- 
pression is applied for the cosmic error as a function of Qn, 



N and £ even if 7 7^ 1.8 or if function £(£) is not exactly a 
power-law, like in the CDM case. 

B2 Accuracy Tests 

The difficulty in obtaining accurate results for the optimal 
weight relies mostly in the estimation of the correlation ma- 
trix Here, we assume that the finite volume error domi- 
nates over the edge effect and the shot noise errors, although 
we performed extensive accuracy tests including all sources 
of error. 

In what follows, we study the optimal weight u)(r), 
which depends only on the shape of the two-point corre- 
lation function (and thus not on higher order statistics). 
We consider a spherical sample, where there is no need in 
principle to compute numerically P(fi\u, v) as straightfor- 
wardly P(fi\u, v) = 1/2. Furthermore, the two-point corre- 
lation function is assumed to be a power law of index —7 
[eq. ([33j)], which allows the analytical computation of the 
correlation matrix f^: 

fa =|^t [F(r i + A/2,r j + A/2) 



t{L)V 



with 



F(u,v) 



+ F(n-A/2, rj - A/2) 
-F(n + A/2,rj-A/2) 
- F(n - A/2, rj + A/2)] (B21) 



(2- 7 )(3- 7 )(4-7)(6-7) 

[(6 — 7)™ { |tt — u| 4 ~ 7 + (it + w) 4 ~ 7 } 

+ {| tt _ w |6-7_ (M + „ ) 6- 7 |j _ (B22) 



and 

4 .3 
V = -7r_R max . 

Therefore 

A, = 4^r 2 A [ 1 + ^-5 



(B23) 



(B24) 



In figure Bl, the minimum variance weight is shown as 
a function of distance from the observer, obtained from 
three different calculations, each with JVbm = 50. The solid 
curve corresponds to the analytical result [eqs. (B21) and 
(B24)]. There is a dashed curve almost perfectly super- 



posing to the solid one, except from the extrem e lef t part 
of it. In that case, equations (B9), (B19) and (B17) with 
P(/x|u, «)interpol = 0.5 were used to compute £jj and Ai. We 
have taken 2V su bbin = 30 and N z = 50 (as for all the figures of 
the main text, when it is relevant). For both the solid and the 
dashed curves, there is an irregularity at each end. The left 
irregularity, corresponding to the weight given (nearly) at 
the center of the catalog, is due to the limitations of the dis- 
crete approach. In the continuous limit, the function u>(r) is 
expected to behave smoothly in the neighborhood of r — 0. 
The right irregularity is more difficult to understand, and 
will be discussed later. The fluctuating dotted curve is the 
same as the dashed one, but we computed the probability 
distribution function P(/x|m, v) by Monte-Carlo simulation 
as explained in § B.l (with Mtcr = 100,000 iterations for 
iV M = 100). The left irregularity for this is quite dramatic, 
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Figure Bl. The minimum variance weight u) is shown as a func- 
tion of distance R from the observer in the case the finite vol- 
ume error is dominant (the results displayed here correspond to 
i = 10 Mpc, but would not change significantly for other val- 
ues of £). For the finite volume error, u does not depend on the 
order k. The two point function is assumed to be a power-law 
£(r) oc r The calculations were performed assuming spherical 
geometry, except for the long dashes which show the result for a 
SDSS-like geometry. The solid curve corresponds to the full ana- 
lytical calculation of the correlation matrix Hi j. A dashed curve 
is almost perfectly superposing to the sol id one. In th at ca se, the 
numerical scheme given by equations ( |B9[ ), (B19) and (B17) (with 
Interpol (A t l u ) ") = 0.5) has been used. The dotted line shows the 
result as calculated via a Monte-Carlo simulation plus bilinear 
interpolation to compute -Pintcrpol(^l n i v). The long dashes, cor- 
responding to a SDSS geometry, were obtained the same way as 
the dots. 



as there is one value of the estimated weight which is nega- 
tive. Note, however, that the calculation of the finite volume 
error is fairly robust with respect to the weight: the weights 
given by the solid, the (invisible) short-dashed or the dotted 
curve give exactly the same result. The long-dashed curve 
is the same as the dotted one, expect that it corresponds 
to an SDSS like geometry covering thus one quarter of the 
sky. The irregularities are less pronounced in that case, due 
to the anisotropy of the distribution P(/x|«, v) of values of n 
(see also the solid curves of Fig. 

Figure |b^ shows the effect of changing iVbin when calcu- 
lating the optimal weight. There are six curves, correspond- 
ing respectively to N bin = 6, 12, 25, 50, 100 and 200. The 
method used for the calculation is the same as the (almost 
invisible) dashed line in figure Bl. Except for the right and 



the left end-points of the curves, resolution is not critical for 
determining w(r). The left irregularity on each curve was 
shown to be unphysical and due to the numerical limita- 
tions of our calculation. The right one is more of a concern, 
since the estimator of log 10 [u;(-R m ax)] increases linearly with 
AW, suggesting that the actual optimal weight is singular 
at -Rmax- As a consequence, A/bin has to be large enough 
to resolve this singularity sufficiently. The cosmic error can 
be sensitive to the singularity, especially if the order k is 
large and if the finite volume error is significant. In realistic 
cases, however, the two-point correlation does not behave 
like a power-law of index —7 = —1.8 up to arbitrarily large 
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1000 



Figure B2. Same as figure [Bl| except that the sensitivity of the 
results to the value of JVbm are tested for iVbin = 6, 12. 25 . 5 0. 100 
and 200. The numerical scheme given by equations ( |B9[ ), (B19) 
and (B17) (with F; nterpo i(/i|n, v) = 0.5) was used. 



scales, thus the singularity is expected to be less pronounced 
or to disappear. For example, in the CDM case, the effect is 
less significant, although there is still a slight instability at 

R max 

(see Fig.0). 



B3 The optimal sampling rate 

The methods for generating figures |^ and ^ will be explained 
in this section. To simplify the calculations and since it was 
found in § 4.2 to be a good approximation, we take for the 
optimal weight ui the expression given by equation (p4"|). To 
estimate the cosmic error from equation (B8), the matrix 



and the vector &i have to be computed for rather large 
values of J? ma x in order to scan a large enough range of the 
sampling rate / [see eq. (§)]. Solving the implicit equation 
for minimizing the cosmic error as a function of the sam- 
pling rate by recalculating the matrix £ij for each value 
of / would be too costly from the computational point of 
view. Instead, we calculated for i? ma x = 5000 Mpc and 
with A/bin = 200. A given value of / corresponds to some 
fl max (/) and thus roughly to some value of r^. This is taken 
to be the as close as possible to i? ma x(/)- To compute the 
cosmic error corresponding to some value of /, quadratic 
interpolation is performed between the values obtained for 
-R m ax(/) = r *-ii -Rmax(/) = n and _R m ax(/) = n+i- Even 
this smoothing procedure cannot guarantee smoothness for 
/ as a function of scale, although the calculation is valid 
within a few percents. The implicit equation for finding the 
minimum of function Aj? osmic (/) is easy to solve by bisection 
as this function is convex (see figure 



APPENDIX C: ASYMPTOTIC REGIMES 

Using the results of SC (valid at leading order in v/V), this 
section presents an analytic estimate of the optimal sam- 
pling rate / for a three-dimensional galaxy catalog (§ 5.1 
and 5.2). This is possible in the highly and weakly nonlinear 
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regimes, where £ S> 1 and ( < 1, respectively. According to 
the results of § 5.1, uniform selection <j> — 1 can be assumed. 

From the calculations of SC, the cosmic error on the 
factorial moments of order k < 3 in the highly nonlinear 
regime is (using the SS approximation, see § A.l) 



suming that edge effects dominate, equations ( p^ ) and ( |75| ) 
naturally follow. 

Note that if finite volume effects dominate edge effects 
in the regime £ <C 1, and AT 3> 1, the optimal sampling rate, 
with the notations of S 5, becomes 



cosmic-* 



■"cosmic- 1 



F 2 



■ Ft 

QIC -J 



F 3 



where 



+ 5.51 



0.5 



38.8 1 

+ Qi n c + 



(CI) 



(C2) 



0.19 1 

185Q 6 

QI 



+ 



4.93Q4 



(C3) 



1-3C (L 



( L IC { 



(C7) 



where £ = 1/5 or 1/9 according to whether the redshifts 
are collected individually (§ 5.1) or collectively (§ 5.2). For 
the same choice of the parameters as in § 5, we find / ~ 
1/20 for individual collection of redshifts and / ~ 1/10 for 
simultaneous collection independently of scale and of the 
order k. As a result, the general conclusions of our analysis 
in § 5.1 and 5.2 are not changed, even if finite volume effects 
dominate over edge effects. 

This paper has been produced using the Royal Astronomical 
Society/Blackwell Science IATjTjX style file. 



N C = N £ 



(C4) 



is the typical number of galaxies in a cell located in an over- 
dense region (e.g. BS). At small enough scales and if 7 < 3, 
it is possible that iV c <C 1 irrespectively of /. In that case, 
count-in-cells statistics is dominated by discreteness effects 
(BS). The edge effect error, corresponding to the term pro- 
portional to £v/V and independent of N c , is negligible com- 
pared to the shot noise error. Equations ( |m| ) or ( fri]) follow 
straightforwardly. 

In the weakly nonlinear regime, where ^ <gC 1, the Gaus- 
sian limit at lea ding order in £ is a good approximation. The 
expression (CI) for the error on Fi is still valid. At higher 
order, however, 



cosmic 



Fz 



Fz 



4£(L) + 17.05£^ 

+ (0.65 + 4iV)i^, 
N V 



(C5) 



+ (0.88 + 5.83iV + 9iV 2 ) —^.(06) 



If the scale is large enough, edge effects are expected to 
dominate over the finite volume errors, but this property 
depends on the details of the underlying statistics and on 
the geometry of the catalog. More rigorous estimates of the 
cosmic error, taking fully into account the geometry of the 
catalog would be needed to prove it, a level of accuracy out- 
side of the scope of this paper. Exact calculations of the 
cosmic error when the size of the cell becomes comparable 
to the size of the catalog are indeed quite tedious (see for 
example Appendix B of SC). Although our approach is valid 
only t o lea ding order in v/V , setting v/V = 1 in equations 



(CI), (C5), and ( |C6| ) yields at least a lower limit for the rel- 
ative contribution of edge effects compared to finite volume 
effects. The result is that the edge error is a few times larger 
than the finite volume error (for a spherical catalog). As- 
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